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HELICOBACTER POLYPEPTIDES AND CORRESPONDING 
POLYNUCLEOTIDE MOLECULES 
5 The invention relates to Helicobacter antigens and corresponding 

polynucleotide molecules that can be used in methods to prevent or treat 
Helicobacter infection in mammals, such as humans. 



Background of the Invention 
Helicobacter is a genus of spiral, gram-negative bacteria that 

10 colonize the gastrointestinal tracts of mammals. Several species colonize the 
stomach, most notably //. pylori, H, heilmanii, Kfelis, and H. mustelae. 
Although H. pylori is the species most commonly associated with human 
infection, H, heilmanii and H.felis have also been isolated from humans, but at 
lower frequencies than H. pylori. Helicobacter infects over 50% of adult 

15 populations in developed countries and nearly 100% in developing countries 
and some Pacific rim countries, making it one of the most prevalent infections 
worldwide. 

Helicobacter is routinely recovered from gastric biopsies of humans 
with histological evidence of gastritis and peptic ulceration. Indeed, H pylori 
20 is now recognized as an important pathogen of humans, in that the chronic 
gastritis it causes is a risk factor for the development of peptic ulcer diseases 
and gastric carcinoma. It is thus highly desirable to develop safe and effective 
vaccines for preventing and treating Helicobacter infection. 

A number of Helicobacter antigens have been characterized or 
25 isolated. These include urease, which is composed of two structural subunits of 
approximately 30 and 67 kDa (Hu et al., Infect. Immun. 58:992, 1990; Dunn et 
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al., J. Biol. Chem. 265:9464, 1990; Evans et al., Microbial Pathogenesis 10:15, 
1991; Labigne et al, J. Bact., 173:1920, 1991); the 87 kDa vacuolar cytotoxin 
(VacA) (Cover et al, J. Biol. Chem. 267:10570, 1992; Phadnis et al., Infect. 
Immun. 62:1557, 1994; WO 93/18150); a 128 kDa immunodominant antigen 
5 associated with the cytotoxin (CagA, also called TagA; WO 93/1 8 1 50; U.S. 
Patent No. 5,403,924); 13 and 58 kDa heat shock proteins HspA and HspB 
(Suerbaum et al., Mol. Microbiol. 14:959, 1994; WO 93/18150); a 54 kDa 
catalase (Hazell et al., J. Gen. Microbiol. 137:57, 1991); a 15 kDa histidine-rich 
protein (Hpn) (Gilbert et al, Infect. Immun. 63:2682, 1995); a 20 kDa 

10 membrane-associated lipoprotein (Kostrcynska et al., J. Bact. 176:5938, 1994); 
a 30 kDa outer membrane protein (Bolin et al., J. Clin. Microbiol. 33:381, 
1995); a lactoferrin receptor (FR 2,724,936); and several porins, designated 
HopA, HopB, HopC, HopD, and HopE, which have molecular weights of 
48-67 kDa (Exner et al., Infect. Immun. 63:1567, 1995; Doig et al., J. Bact. 

1 5 1 77:5447, 1 995). Some of these proteins have been proposed as potential 
vaccine antigens. In particular, urease is believed to be a vaccine candidate 
(WO 94/9823; WO 95/22987; WO 95/3824; Michetti et al., Gastroenterology 
107:1002,1994). Nevertheless, it is thought that several antigens may 
ultimately be necessary in a vaccine. 

20 Summary of the Invention 

The invention provides polynucleotide molecules that encode 
Helicobacter polypeptides, designated GHPO 13, GHPO 73, GHPO 90, GHPO 
107, GHPO 136, GHPO 191, GHPO 213, GHPO 240, GHPO 408, GHPO 41 1, 
GHPO 419, GHPO 431, GHPO 474, GHPO 591, GHPO 596, GHPO 699, 
, 25 GHPO 724, GHPO 730, GHPO 761, GHPO 804, GHPO 805, GHPO 812, 

GHPO 879, GHPO 888, GHPO 986, GHPO 1056, GHPO 1081, GHPO 1 100, 
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GHPO 1140, GHPO 1148, GHPO 1200, GHPO 1212, GHPO 1258, GHPO 
1263, GHPO 1273, GHPO 1284, GHPO 1299, GHPO 1327, GHPO 1346, 
GHPO 1378, GHPO 1412, GHPO 1443, GHPO 1466, GHPO 1476, GHPO 
1536, GHPO 1559, GHPO 427, GHPO 1045, GHPO 1262, GHPO 1688, 

5 GHPO 1538, GHPO 346, GHPO 1012, GHPO 470, GHPO 1398, GHPO 1550, 
GHPO 276, GHPO 1501, GHPO 706, GHPO 1001, GHPO 732, GHPO 329, 
GHPO 574, GHPO 1 190, GHPO 1374, GHPO 1620, GHPO 956, HPO 98, 
GHPO 689, GHPO 208, GHPO 296, GHPO 726, GHPO 1026, GHPO 1301, 
GHPO 1536, GHPO 166, GHPO 253, GHPO 297, GHPO 615, GHPO 1278, 

10 GHPO 1282, GHPO 1420, GHPO 1484, GHPO 1719, and GHPO 1252, which 
can be used, e.g., in methods to prevent, treat, or diagnose Helicobacter 
infection. The polypeptides of the invention include those having the amino 
acid sequences shown in SEQ ID NOs:2-170 (even numbers), as well as mature 
forms of proteins having sequences shown in SEQ ID NOs:2-170 in their 

15 unprocessed forms, and fragments thereof. Those skilled in the art will 
understand that the invention also includes polynucleotide molecules that 
encode mutants and derivatives of these polypeptides, which can result from 
the addition, deletion, or substitution of non-essential amino acids, as is 
described further below. 

20 In addition to the polynucleotide molecules described above, the 

invention includes the corresponding polypeptides (Le. 9 polypeptides encoded 
by the polynucleotide molecules of the invention, or fragments thereof), and 
monospecific antibodies that specifically bind to these polypeptides. 

The present invention has many applications and includes expression 

25 cassettes, vectors, and cells transformed or transfected with the polynucleotides 
of the invention. Accordingly, the present invention provides (i) methods for 
producing polypeptides of the invention in recombinant host systems and 
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related expression cassettes, vectors, and transformed or transfected cells; (ii) 
live vaccine vectors, such as pox virus, Salmonella typhimurium, and Vibrio 
cholerae vectors, that contain polynucleotides of the invention (such vaccine 
vectors being useful in, e.g., methods for preventing or treating Helicobacter 
5 infection) in combination with a diluent or carrier, and related pharmaceutical 
compositions and associated therapeutic and/or prophylactic methods; (iii) 
therapeutic and/or prophylactic methods involving administration of 
polynucleotide molecules, either in a naked form or formulated with a delivery 
vehicle, polypeptides or mixtures of polypeptides, or monospecific antibodies 
10 of the invention, and related pharmaceutical compositions; (iv) methods for 
detecting the presence of Helicobacter in biological samples, which can 
involve the use of polynucleotide molecules, monospecific antibodies, or 
polypeptides of the invention; and (v) methods for purifying polypeptides of 
the invention by antibody-based affinity chromatography. 

15 Brief Description of the Drawings 

Fig. 1 A is a diagrammatic representation of transposon TnMax9, 
which is a derivative of the TnMax transposon system (Haas et al. Gene 
1 30:23-2 1, 1993). The mini-transposon carries the blaM gene, which is the p- 
lactamase gene lacking a promoter and a signal sequence, next to the inverted 

20 repeats (IR) and the Ml 3 forward (M 1 3-FP) and reverse (M 1 3-RP 1 ) primer 
binding sites. The resolution site {res) and an origin of replication (ori fd ) are 
located between the BlaM gene and the constitutive ca/ GC -resistance gene. The 
transposase tnpA and resolvase tnpR genes are located outside of the mini- 
transposon and are under the control of the inducible P trc promoter. The laclq 

25 gene encodes the Lac repressor. 
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Fig. IB is a diagrammatic representation of plasmid pMin2. pMin2 
contains a multiple cloning site, the tetracycline resistance gene (tet), an origin 
of transfer (oriT), an origin of replication (or/ ColE1 ), a transcriptional terminator 
(t fd ), and a weak, constitutive promoter (P jsa ). H. pylori chromosome fragments 
5 were introduced into the Bglll and Cla\ sites of pMin2. 



Detailed Description 
Open reading frames (ORFs) encoding new, full length polypeptides, 
designated GHPO 13, GHPO 73, GHPO 90, GHPO 107, GHPO 136, GHPO 
191, GHPO 213, GHPO 240, GHPO 408, GHPO 41 1, GHPO 419, GHPO 431, 

1 0 GHPO 474, GHPO 59 1 , GHPO 596, GHPO 699, GHPO 724, GHPO 730, 
GHPO 761, GHPO 804, GHPO 805, GHPO 812, GHPO 879, GHPO 888, 
GHPO 986, GHPO 1056, GHPO 1081, GHPO 1 100, GHPO 1 140, GHPO 
1 148, GHPO 1200, GHPO 1212, GHPO 1258, GHPO 1263, GHPO 1273, 
GHPO 1284, GHPO 1299, GHPO 1327, GHPO 1346, GHPO 1378, GHPO 

15 1412, GHPO 1443, GHPO 1466, GHPO 1476, GHPO 1536, GHPO 1559, 

GHPO 427, GHPO 1045, GHPO 1262, GHPO 1688, GHPO 1538, GHPO 346, 
GHPO 1012, GHPO 470, GHPO 1398, GHPO 1550, GHPO 276, GHPO 1501, 
GHPO 706, GHPO 1001, GHPO 732, GHPO 329, GHPO 574, GHPO 1 190, 
GHPO 1374, GHPO 1620, GHPO 956, HPO 98, GHPO 689, GHPO 208, 

20 GHPO 296, GHPO 726, GHPO 1026, GHPO 1301, GHPO 1536, GHPO 166, 
GHPO 253, GHPO 297, GHPO 615, GHPO 1278, GHPO 1282, GHPO 1420, 
GHPO 1484, GHPO 1719, and GHPO 1252, have been identified in the H. 
pylori genome. These polypeptides can be used, for example, in vaccination 
methods for preventing or treating Helicobacter infection. Some of the new 

25 polypeptides are secreted polypeptides that can be produced in their mature 

forms {i.e., as polypeptides that have been exported through class II or class III 
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secretion pathways) or as precursors that include signal peptides, which can be 
removed in the course of excretion/secretion by cleavage at the N-terminal end 
of the mature form. (The cleavage site is located at the C-terminal end of the 
signal peptide, adjacent to the mature form.) 

5 According to a first aspect of the invention, there are provided 

isolated polynucleotides that encode the precursor and mature forms of 
Helicobacter GHPO 13, GHPO 73, GHPO 90, GHPO 107, GHPO 136, GHPO 
1 9 1 , GHPO 2 1 3 , GHPO 240, GHPO 408 , GHPO 411, GHPO 419, GHPO 43 1 , 
GHPO 474, GHPO 591, GHPO 596, GHPO 699, GHPO 724, GHPO 730, 

10 GHPO 761, GHPO 804, GHPO 805, GHPO 812, GHPO 879, GHPO 888, 
GHPO 986, GHPO 1056, GHPO 1081, GHPO 1100, GHPO 1 140, GHPO 
1 148, GHPO 1200, GHPO 1212, GHPO 1258, GHPO 1263, GHPO 1273, 
GHPO 1284, GHPO 1299, GHPO 1327, GHPO 1346, GHPO 1378, GHPO 
1412, GHPO 1443, GHPO 1466, GHPO 1476, GHPO 1536, GHPO 1559, 

15 GHPO 427, GHPO 1045, GHPO 1262, GHPO 1688, GHPO 1538, GHPO 346, 
GHPO 1012, GHPO 470, GHPO 1398, GHPO 1550, GHPO 276, GHPO 1501, 
GHPO 706, GHPO 1001, GHPO 732, GHPO 329, GHPO 574, GHPO 1 190, 
GHPO 1374, GHPO 1620, GHPO 956, HPO 98, GHPO 689, GHPO 208, 
GHPO 296, GHPO 726, GHPO 1026, GHPO 1301, GHPO 1536, GHPO 166, 

20 GHPO 253, GHPO 297, GHPO 615, GHPO 1278, GHPO 1282, GHPO 1420, 
GHPO 1484, GHPO 1719, and GHPO 1252. 

An isolated polynucleotide of the invention encodes: 
(i) a polypeptide having an amino acid sequence that is homologous 
to a Helicobacter amino acid sequence of a polypeptide, the Helicobacter 

25 amino acid sequence being selected from the group consisting of the amino 

acid sequences shown in SEQ ID NO:2 (GHPO 13), SEQ ID NO:4 (GHPO 73), 
SEQ ID NO:6 (GHPO 90), SEQ ID NO: 8 (GHPO 107), SEQ ID NO: 10 
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(GHPO 136), SEQ ID NO: 12 (GHPO 191), SEQ ID NO: 14 (GHPO 213), SEQ 
ID NO: 16 (GHPO 240), SEQ ID NO: 18 (GHPO 408), SEQ ID NO:20 (GHPO 
411), SEQ ID NO:22 (GHPO 419), SEQ ID NO:24 (GHPO 431), SEQ ID 
NO:26 (GHPO 474), SEQ ID NO:28 (GHPO 591), SEQ ID NO:30 (GHPO 

5 596), SEQ ID NO:32 (GHPO 699), SEQ ID NO:34 (GHPO 724), SEQ ID 
NO:36 (GHPO 730), SEQ ID NO:38 (GHPO 761), SEQ ID NO:40 (GHPO 
804), SEQ ID NO:42 (GHPO 805), SEQ ID NO:44 (GHPO 812), SEQ ID 
NO:46 (GHPO 879), SEQ ID NO:48 (GHPO 888), SEQ ID NO:50 (GHPO 
986), SEQ ID NO:52 (GHPO 1056), SEQ ID NO:54 (GHPO 1081), SEQ ID 

10 NO:56 (GHPO 1 100), SEQ ID NO:58 (GHPO 1 140), SEQ ID NO:60 (GHPO 
1 148), SEQ ID NO:62 (GHPO 1200), SEQ ID NO:64 (GHPO 1212), SEQ ID 
NO:66 (GHPO 1258), SEQ ID NO:68 (GHPO 1263), SEQ ID NO:70 (GHPO 
1273), SEQ ID NO:72 (GHPO 1284), SEQ ID NO:74 (GHPO 1299), SEQ ID 
NO:76 (GHPO 1327), SEQ ID NO:78 (GHPO 1346), SEQ ID NO:80 (GHPO 

15 1378), SEQ ID NO:82 (GHPO 1412), SEQ ID NO:84 (GHPO 1443), SEQ ID 
NO:86 (GHPO 1466), SEQ ID NO:88 (GHPO 1476), SEQ ID NO:90 (GHPO 
1536), SEQ ID NO:92 (GHPO 1559), SEQ ID NO:94 (GHPO 427), SEQ ID 
NO:96 (GHPO 1045), SEQ ID NO:98 (GHPO 1262), SEQ ID NO:100 (GHPO 
1688), SEQ ID NO:102 (GHPO 1538), SEQ ID NO:104 (GHPO 346), SEQ ID 

20 NO:106(GHPO 1012), SEQ ID NO:108 (GHPO 470), SEQ ID NO:l 10 

(GHPO 1398), SEQ ID NO:l 12 (GHPO 1550), SEQ ID NO:l 14 (GHPO 276), 
SEQ ID NO:l 16 (GHPO 1501), SEQ ID NO:l 18 (GHPO 706), SEQ ID 
NO: 120 (GHPO 1001), SEQ ID NO: 122 (GHPO 732), SEQ ID NO: 124 
(GHPO 329), SEQ ID NO: 126 (GHPO 574), SEQ ID NO: 128 (GHPO 1190), 

25 SEQ ID NO:130 (GHPO 1374), SEQ ID NO:132 (GHPO 1620), SEQ ID 
NO: 134 (GHPO 956), SEQ ID NO: 136 (HPO 98), SEQ ID NO: 138 (GHPO 
689), SEQ ID NO: 140 (GHPO 208), SEQ ID NO: 142 (GHPO 296), SEQ ID 
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NO:144 (GHPO 726), SEQ ID NO:146 (GHPO 1026), SEQ ID NO:148 
(GHPO 1301), SEQ IDNO:150 (GHPO 1536), SEQ ID NO:152 (GHPO 166), 
SEQ ID NO: 154 (GHPO 253), SEQ ID NO: 156 (GHPO 297), SEQ ID NO: 158 
(GHPO 615), SEQ ID NO:160 (GHPO 1278), SEQ ID NO:162 (GHPO 1282), 
5 SEQ ID NO: 164 (GHPO 1420), SEQ ID NO: 166 (GHPO 1484), SEQ ID 
NO: 168 (GHPO 1719), and SEQ ID NO: 170 (GHPO 1252); or 
(ii) a derivative of the polypeptide. 

In addition to the full-length polypeptides encoded by the 
polynucleotides of the invention, as set forth above, polynucleotides included in 

1 0 the invention can also encode polypeptides that lack signal sequences, as well 
as other polypeptide or peptide fragments of the full-length polypeptides. 

The term "isolated polynucleotide" is defined as a polynucleotide 
that is removed from the environment in which it naturally occurs. For 
example, a naturally-occurring DNA molecule present in the genome of a 

1 5 living bacteria or as part of a gene bank is not isolated, but the same molecule, 
separated from the remaining part of the bacterial genome, as a result of, e.g., a 
cloning event (amplification), is "isolated." Typically, an isolated DNA 
molecule is free from DNA regions (e.g., coding regions) with which it is 
immediately contiguous, at the 5' or 3' ends, in the naturally occurring genome! 

20 Such isolated polynucleotides can be part of a vector or a composition and still 
be isolated, as such a vector or composition is not part of its natural 
environment. 

A polynucleotide of the invention can consist of RNA or DNA (e.g., 
cDNA, genomic DNA, or synthetic DNA), or modifications or combinations of 
25 RNA or DNA. The polynucleotide can be double-stranded or single-stranded 
and, if single-stranded, can be the coding (sense) strand or the non-coding (anti- 
sense) strand. The sequences that encode polypeptides of the invention, as 
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shown in any of SEQ ID NOs:2-170 (even numbers), can be (a) the coding 
sequence as shown in any of SEQ ID NOs:l-169 (odd numbers); (b) a 
ribonucleotide sequence derived by transcription of (a); or (c) a different 
coding sequence that, as a result of the redundancy or degeneracy of the genetic 

5 code, encodes the same polypeptides as the polynucleotide molecules having 
the sequences illustrated in any of SEQ IDNOs:l-169 (odd numbers). The 
polypeptide can be one that is naturally secreted or excreted by, e.g., H.felis, 
H. mustelae, H. heilmanii, or H. pylori. 

By "polypeptide" or "protein" is meant any chain of amino acids, 

1 0 regardless of length or post-translational modification (e.g., glycosylation or 
phosphorylation). Both terms are used interchangeably in the present 
application. 

By "homologous amino acid sequence" is meant an amino acid 
sequence that differs from an amino acid sequence shown in any of SEQ ID 

15 NOs:2-170 (even numbers), or an amino acid sequence encoded by the 

nucleotide sequence of any of SEQ ID NOs:l-169 (odd numbers), by one or 
more non-conservative amino acid substitutions, deletions, or additions located 
at positions at which they do not destroy the specific antigenicity of the 
polypeptide. Preferably, such a sequence is at least 75%, more preferably at 

20 least 80%, and most preferably at least 90% identical to an amino acid 
sequence shown in any of SEQ ID NOs:2-170 (even numbers). 

Homologous amino acid sequences include sequences that are 
identical or substantially identical to an amino acid sequence as shown in any 
of SEQ ID NOs:2-170 (even numbers). By "amino acid sequence that is 

25 substantially identical" is meant a sequence that is at least 90%, preferably at 
least 95%, more preferably at least 97%, and most preferably at least 99% 
identical to an amino acid sequence of reference and that differs from the 
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sequence of reference, if at all, by a majority of conservative amino acid 
substitutions. 

Conservative amino acid substitutions typically include substitutions 
among amino acids of the same class. These classes include, for example, 
5 amino acids having uncharged polar side chains, such as asparagine, glutamine, 
serine, threonine, and tyrosine; amino acids having basic side chains, such as 
lysine, arginine, and histidine; amino acids having acidic side chains, such as 
aspartic acid and glutamic acid; and amino acids having nonpolar side chains, 
such as glycine, alanine, valine, leucine, isoleucine, proline, phenylalanine, 

1 0 methionine, tryptophan, and cysteine. 

Homology can be measured using sequence analysis software (e.g., 
Sequence Analysis Software Package of the Genetics Computer Group, 
University of Wisconsin Biotechnology Center, 1710 University Avenue, 
Madison, WI 53705). Similar amino acid sequences are aligned to obtain the 

15 maximum degree of homology (i.e., identity). To this end, it may be necessary 
to artificially introduce gaps into the sequence. Once the optimal alignment has 
been set up, the degree of homology (/.e., identity) is established by recording 
all of the positions in which the amino acids of both sequences are identical, 
relative to the total number of positions. 

20 Homologous polynucleotide sequences are defined in a similar way. 

Preferably, a homologous sequence is one that is at least 45%, more preferably 
at least 60%, and most preferably at least 85% identical to a coding sequence of 
any of SEQ ID NOs:l-169 (odd numbers). 

Polypeptides having a sequence homologous to any one of the 

25 sequences shown in SEQ ID NOs:2-170 (even numbers), include naturally- 
occurring allelic variants, as well as mutants or any other non-naturally 
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occurring variants that are analogous in terms of antigenicity, to a polypeptide 
having a sequence as shown in any one of SEQ ID NOs:2-170 (even numbers). 

As is known in the art, an allelic variant is an alternate form of a 
polypeptide that is characterized as having a substitution, deletion, or addition 

5 of one or more amino acids that does not alter the biological function of the 
polypeptide. By "biological function" is meant a function of the polypeptide in 
the cells in which it naturally occurs, even if the function is not necessary for 
the growth or survival of the cells. For example, the biological function of a 
porin is to allow the entry into cells of compounds present in the extracellular 

10 medium. The biological function is distinct from the antigenic function. A 
polypeptide can have more than one biological function. 

Allelic variants are very common in nature. For example, a bacterial 
species, e.g., H. pylori is usually represented by a variety of strains that differ 
from each other by minor allelic variations. Indeed, a polypeptide that fulfills 

1 5 the same biological function in different strains can have an amino acid 

sequence that is not identical in each of the strains. Such an allelic variation 
can be equally reflected at the polynucleotide level. 

Support for the use of allelic variants of polypeptide antigens comes 
from, e.g., studies of the Helicobacter urease antigen. The amino acid 

20 sequence of Helicobacter urease varies widely from species to species, yet 

cross-species protection occurs, indicating that the urease molecule,' when used 
as an immunogen, is highly tolerant of amino acid variations. Even among 
different strains of the single species H. pylori, there are amino acid sequence 
variations. 

25 For example, although the amino acid sequences of the UreA and 

UreB subunits ofH. pylori and H.felis ureases differ from one another by 
26.5% and 1 1.8%, respectively (Ferrero et al, Molecular Microbiology 
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9(2):323-333, 1993), it has been shown that H. pylori urease protects mice from 
H.felis infection (Michetti et al., Gastroenterology 107:1002, 1994). In 
addition, it has been shown that the individual structural subunits of urease, 
UreA and UreB, which contain distinct amino acid sequences, are both 
5 protective antigens against Helicobacter infection (Michetti et al., supra). 
Similarly, Cuenca et al. (Gastroenterology 1 10:1770, 1996) showed that 
therapeutic immunization of H. mustelae-mfected ferrets with H. pylori urease 
was effective at eradicating H. mustelae infection. Further, several urease 
variants have been reported to be effective vaccine antigens, including, e.g., 

1 0 recombinant UreA + UreB apoenzyme expressed from pORV 1 42 (UreA and 
UreB sequences derived from H. pylori strain CPM630; Lee et al., J. Infect. 
Dis.l72:161, 1995); recombinant UreA + UreB apoenzyme expressed from 
pORV214 (UreA and UreB sequences differ from H. pylori strain CPM630 by 
one and two amino acid changes, respectively; Lee et al., supra, 1995); a 

15 UreA-glutathione-S-transferase fusion protein (UreA sequence from H. pylori 
strain ATCC 43504; Thomas et al., Acta Gastro-Enterologica Belgica 56:54, 
1993); UreA + UreB holoenzyme purified from H. pylori strain NCTC1 1637 
(Marchetti et al, Science 267:1655, 1995); a UreA-MBP fusion protein (UreA 
from H. pylori strain 85P; Ferrero et al., Infection and Immunity 62:4981 , 

20 1994); a UreB-MBP fusion protein (UreB from H. pylori strain 85P; Ferrero et 
al., supra); a UreA-MBP fusion protein (UreA from H.felis strain ATCC 
49179; Ferrero et ai, supra); a UreB-MBP fusion protein (UreB from H. felis 
strain ATCC 49179; Ferrero et al., supra); and a 37 kDa fragment of UreB 
containing amino acids 220-569 (Dore-Davin et al., "A 37 kD fragment of 

25 UreB is sufficient to confer protection against Helicobacter felis infection in 
mice"). Finally, Thomas et al. {supra) showed that oral immunization of mice 
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with crude sonicates of//, pylori protected mice from subsequent challenge 
with H.felis. 

Polynucleotides, e.g., DNA molecules, encoding allelic variants can 
easily be obtained by polymerase chain reaction (PCR) amplification of 

5 genomic bacterial DNA extracted by conventional methods. This involves the 
use of synthetic oligonucleotide primers matching sequences that are upstream 
and downstream of the 5* and 3* ends of the coding region. Suitable primers 
can be designed based on the nucleotide sequence information provided in any 
of SEQ ID NOs:l-169 (odd numbers). Typically, a primer consists of 10 to 40, 

10 preferably 1 5 to 25 nucleotides. It can also be advantageous to select primers 
containing C and G nucleotides in proportions sufficient to ensure efficient 
hybridization, e.g., an amount of C and G nucleotides of at least 40%, 
preferably 50%, of the total nucleotide amount. Those skilled in the art can 
readily design primers that can be used to isolate the polynucleotides of the 

15 invention from different Helicobacter strains. Experimental conditions for 
carrying out PCR can readily be determined by one skilled in the art and an 
illustration of carrying out PCR is provided in the Examples below. As is well 
known in the art, restriction endonuclease recognition sites that contain, 
typically, 4 to 6 nucleotides (for example, the sequences 5-GGATCC-3' 

20 (Bamtt) or S'-CTCGAG-S' (Mol)), can be included on the 5' ends of the 

primers. Restriction sites can be selected by those skilled in the art so that the 
amplified DNA can be conveniently cloned into an appropriately digested 
vector, such as a plasmid. 

Useful homologs that do not occur naturally can be designed using 

25 known methods for identifying regions of an antigen that are likely to be 
tolerant of amino acid sequence changes and/or deletions. For example, 
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sequences of the antigen from different species can be compared to identify 
conserved sequences. 

Polypeptide derivatives that are encoded by polynucleotides of the 
invention include, e.g., fragments, polypeptides having large internal deletions 
5 derived from full-length polypeptides, and fusion proteins. Polypeptide 
fragments of the invention can be derived from a polypeptide having a 
sequence homologous to any of the sequences of SEQ ID NOs:2-170 (even 
numbers), to the extent that the fragments retain the substantial antigenicity of 
the parent polypeptide (specific antigenicity). Polypeptide derivatives can also 

10 be constructed by large internal deletions that remove a substantial part of the 
parent polypeptide, while retaining specific antigenicity. Generally, 
polypeptide derivatives should be about at least 12 amino acids in length to 
maintain antigenicity. Advantageously, they can be at least 20 amino acids, 
preferably at least 50 amino acids, more preferably at least 75 amino acids, and 

15 most preferably at least 100 amino acids in length. 

Useful polypeptide derivatives, e.g., polypeptide fragments, can be 
designed using computer-assisted analysis of amino acid sequences in order to 
identify sites in protein antigens having potential as surface-exposed, antigenic 
regions (Hughes et aL, Infect. Immun. 60(9):3497, 1992). For example, the 

20 Laser Gene Program from DNA Star can be used to obtain hydrophilicity, 

antigenic index, and intensity index plots for the polypeptides of the invention. 
This program can also be used to obtain information about homologies of the 
polypeptides with known protein motifs. One skilled in the art can readily use 
the information provided in such plots to select peptide fragments for use as 

25 vaccine antigens. For example, fragments spanning regions of the plots in 
which the antigenic index is relatively high can be selected. One can also 
select fragments spanning regions in which both the antigenic index and the 
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intensity plots are relatively high. Fragments containing conserved sequences, 
particularly hydrophilic conserved sequences, can also be selected. 

Polypeptide fragments and polypeptides having large internal 
deletions can be used for revealing epitopes that are otherwise masked in the 
5 parent polypeptide and that may be of importance for inducing a protective T 
cell -dependent immune response. Deletions can also remove immunodominant 
regions of high variability among strains. 

It is an accepted practice in the field of immunology to use fragments 
and variants of protein immunogens as vaccines, as all that is required to induce 

10 an immune response to a protein is a small (e.g., 8 to 10 amino acids) 

immunogenic region of the protein. This has been done for a number of 
vaccines against pathogens other than Helicobacter. For example, short 
synthetic peptides corresponding to surface-exposed antigens of pathogens such 
as murine mammary tumor virus (peptide containing 1 1 amino acids; Dion et 

15 aL, Virology 179:474-477, 1990), Semliki Forest vims (peptide containing 16 
amino acids; Snijders et aL, J. Gen. Virol. 72:557-565, 1991), and canine 
parvovirus (2 overlapping peptides, each containing 15 amino acids; Langeveld 
et al., Vaccine 12(15):1473-1480, 1994) have been shown to be effective 
vaccine antigens against their respective pathogens. 

20 Polynucleotides encoding polypeptide fragments and polypeptides 

having large internal deletions can be constructed using standard methods (see, 
e.g., Ausubel et al. 9 Current Protocols in Molecular Biology, John Wiley & 
Sons Inc., 1994), for example, by PCR, including inverse PCR, by restriction 
enzyme treatment of the cloned DNA molecules, or by the method of Kunkel et 

25 al. (Proc. Natl. Acad. Sci. USA 82:448, 1985; biological material available at 
Stratagene). 
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A polypeptide derivative can also be produced as a fusion 
polypeptide that contains a polypeptide or a polypeptide derivative of the 
invention fused, e.g., at the N- or C-terminal end, to any other polypeptide 
(hereinafter referred to as a peptide tail). Such a product can be easily obtained 
5 by translation of a genetic fusion, i.e., a hybrid gene. Vectors for expressing 
fusion polypeptides are commercially available, and include the pMal-c2 or 
pMal-p2 systems of New England Biolabs, in which the peptide tail is a 
maltose binding protein, the glutathione-S-transferase system of Pharmacia, or 
the His-Tag system available from Novagen. These and other expression 

10 systems provide convenient means for further purification of polypeptides and 
derivatives of the invention. 

Another particular example of fusion polypeptides included in 
invention includes a polypeptide or polypeptide derivative of the invention 
fused to a polypeptide having adjuvant activity, such as, e.g., subunit B of 

1 5 either cholera toxin or E. coli heat-labile toxin. Several possibilities can be 

used for producing such fusion proteins. First, the polypeptide of the invention 
can be fused to the 

N-terminal end or, preferably, to the C-terminal end of the polypeptide having 

adjuvant activity. Second, a polypeptide fragment of the invention can be fused 
20 within the amino acid sequence of the polypeptide having adjuvant activity. 

Spacer sequences can also be included, if desired. 

As stated above, the polynucleotides of the invention encode 

Helicobacter polypeptides in precursor or mature form. They can also encode 

hybrid precursors containing heterologous signal peptides, which can mature 
25 into polypeptides of the invention. By "heterologous signal peptide" is meant a 

signal peptide that is not found in the naturally-occurring precursor of a 

polypeptide of the invention. 
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A polynucleotide of the invention hybridizes, preferably under 
stringent conditions, to a polynucleotide having a sequence as shown in any of 
SEQ ID NOs:l-169 (odd numbers). Hybridization procedures are, e.g., 
described by Ausubel et aL {suprd)\ Silhavy et aL {Experiments with Gene 

5 Fusions, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York, 1984); and Davis et aL (A Manual for Genetic Engineering: Advanced 
Bacterial Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
New York, 1980). Important parameters that can be considered for optimizing 
hybridization conditions are reflected in the following formula, which 

10 facilitates calculation of the melting temperature (Tm), which is the 

temperature above which two complementary DNA strands separate from one 
another (Casey et aL, NucL Acid Res. 4:1539, 1977): Tm = 81.5 + 0.5 x (% 
G+C) + 1 -6 log (positive ion concentration) - 0.6 x (% formamide). Under 
appropriate stringency conditions, hybridization temperature (Th) is 

15 approximately 20 to 40°C, 20 to 25°C, or, preferably, 30 to 40°C below the 
calculated Tm. Those skilled in the art will understand that optimal 
temperature and salt conditions can be readily determined empirically in 
preliminary experiments using conventional procedures. For example, 
stringent conditions can be achieved, both for pre-hybridizing and hybridizing 

20 incubations, (i) within 4-16 hours at 42 °C, in 6 x SSC containing 

50% formamide or (ii) within 4-16 hours at 65 °C in an aqueous 6 x SSC 
solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)). For polynucleotides 
containing 30 to 600 nucleotides, the above formula is used and then is 
corrected by subtracting (600/polynucleotide size in base pairs). Stringency 

25 conditions are defined by a Th that is 5 to 10 °C below Tm. 

Hybridization conditions with oligonucleotides shorter than 20-30 
bases do not precisely follow the rules set forth above. In such cases, the 
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formula for calculating the Tm is as follows: Tm = 4 x (G+C) + 2 (A+T). For 
example, an 18 nucleotide fragment of 50% G+C would have an approximate 
Tm of54°C. 

A polynucleotide molecule of the invention, containing RNA, DNA, 
5 or modifications or combinations thereof, can have various applications. For 
example, a polynucleotide molecule can be used (i) in a process for producing 
the encoded polypeptide in a recombinant host system, (ii) in the construction 
of vaccine vectors, such as poxviruses, which are further used in methods and 
compositions for preventing and/or treating Helicobacter infection, (iii) as a 

1 0 vaccine agent, in a naked form or formulated with a delivery vehicle, and (iv) 
in the construction of attenuated Helicobacter strains that can over-express a 
polynucleotide of the invention or express it in a non-toxic, mutated form. 

According to a second aspect of the invention, there is therefore 
provided (i) an expression cassette containing a polynucleotide molecule of the 

1 5 invention placed under the control of elements (e.g., a promoter) required for 
expression; (ii) an expression vector containing an expression cassette of the 
invention; (iii) a procaryotic or eucaryotic cell transformed or transfected with 
an expression cassette and/or vector of the invention, as well as (iv) a process 
for producing a polypeptide or polypeptide derivative encoded by a 

20 polynucleotide of the invention, which involves culturing a procaryotic or 
eucaryotic cell transformed or transfected with an expression cassette and/or 
vector of the invention, under conditions that allow expression of the 
polynucleotide molecule of the invention, and recovering the encoded 
polypeptide or polypeptide derivative from the cell culture. 

25 A recombinant expression system can be selected from procaryotic 

and eucaryotic hosts. Eucaryotic hosts include, for example, yeast cells (e.g., 
Saccharomyces cerevisiae or Pichia Pastoris), mammalian cells (e.g., COS1, 
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NIH3T3, or JEG3 cells), arthropods cells (e.g., Spodoptera frugiperda (SF9) 
cells), and plant cells. Preferably, a procaryotic host such as E. coli is used. 
Bacterial and eucaryotic cells are available from a number of different sources 
that are known to those skilled in the art, e.g., the American Type Culture 

5 Collection (ATCC; Rockville, Maryland). 

The choice of the expression cassette will depend on the host system 
selected, as well as the features desired for the expressed polypeptide. For 
example, it may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form. Typically, an expression cassette 

1 0 includes a constitutive or inducible promoter that is functional in the selected 
host system; a ribosome binding site; a start codon (ATG); if necessary, a 
region encoding a signal peptide, e.g., a lipidation signal peptide; a 
polynucleotide molecule of the invention; a stop codon; and, optionally, a 3 ! 
terminal region (translation and/or transcription terminator). The signal 

15 peptide-encoding region is adjacent to the polynucleotide of the invention and 
is placed in the proper reading frame. The signal peptide-encoding region can 
be homologous or heterologous to the polynucleotide molecule encoding the 
mature polypeptide and it can be specific to the secretion apparatus of the host 
used for expression. The open reading frame constituted by the polynucleotide 

20 molecule of the invention, alone or together with the signal peptide, is placed 
under the control of the promoter so that transcription and translation occur in 
the host system. Promoters and signal peptide-encoding regions are widely 
known and available to those skilled in the art and include, for example, the 
promoter of Salmonella typhimurium (and derivatives) that is inducible by 

25 arabinose (promoter araB) and is functional in Gram-negative bacteria such as 
E. coli (U.S. Patent No. 5,028,530; Cagnon et al. 9 Protein Engineering 
4(7):843, 1991); the promoter of the bacteriophage T7 RNA polymerase gene, 



BNSDOCID: <WO 9821225A1 J_> 



WO 98/21225 . PCT/US97/21353 - 

-20- 

which is functional in a number of E. coli strains expressing T7 polymerase 
(U.S. Patent No. 4,952,496); the OspA lipidation signal peptide; and RlpB 
lipidation signal peptide (Takase et aL, J. Bact 169:5692, 1987). 

The expression cassette is typically part of an expression vector, 
5 which is selected for its ability to replicate in the chosen expression system. 
Expression vectors (e.g., plasmids or viral vectors) can be chosen from, for 
example, those described in Pouwels et al. (Cloning Vectors: A Laboratoiy 
Manual, 1985, Supp. 1987), and can be purchased from various commercial 
sources. Methods for transforming or transfecting host cells with expression 

1 0 vectors are well known in the art and will depend on the host system selected, 
as described in Ausubel et aL (supra). 

Upon expression, a recombinant polypeptide of the invention (or a 
polypeptide derivative) is produced and remains in the intracellular 
compartment, is secreted/excreted in the extracellular medium or in the 

15 periplasmic space, or is embedded in the cellular membrane. The polypeptide 
can then be recovered in a substantially purified form from the cell extract or 
from the supernatant after centrifugation of the cell culture. Typically, the 
recombinant polypeptide can be purified by antibody-based affinity purification 
or by any other method known in the art, such as by genetic fusion to a small 

20 affinity-binding domain. Antibody-based affinity purification methods are also 
available for purifying a polypeptide of the invention extracted from a 
Helicobacter strain. Antibodies useful for immunoaffinity purification of the 
polypeptides of the invention can be obtained using methods described below. 
Polynucleotides of the invention can also be used in DNA 

25 vaccination methods, using either a viral or bacterial host as gene delivery 
vehicle (live vaccine vector) or administering the gene in a free form, e.g., 
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inserted into a plasmid. Therapeutic or prophylactic efficacy of a 
polynucleotide of the invention can be evaluated as is described below. 

Accordingly, in a third aspect of the invention, there is provided (i) a 
vaccine vector such as a poxvirus, containing a polynucleotide molecule of the 

5 invention placed under the control of elements required for expression; (ii) a 
composition of matter containing a vaccine vector of the invention, together 
with a diluent or carrier; (iii) a pharmaceutical composition containing a 
therapeutically or prophylactically effective amount of a vaccine vector of the 
invention; (iv) a method for inducing an immune response against Helicobacter 

10 in a mammal (e.g., a human; alternatively, the method can be used in veterinary 
applications for treating or preventing Helicobacter infection of animals, e.g., 
cats or birds), which involves administering to the mammal an 
immunogenically effective amount of a vaccine vector of the invention to elicit 
an immune response, e.g., a protective or therapeutic immune response to 

1 5 Helicobacter, and (v) a method for preventing and/or treating a Helicobacter 
{e.g., H. pylori, H.felis, H. mustelae, or H. heilmanii) infection, which involves 
administering a prophylactic or therapeutic amount of a vaccine vector of the 
invention to an individual in need. Additionally, the third aspect of the 
invention encompasses the use of a vaccine vector of the invention in the 

20 preparation of a medicament for preventing and/or treating Helicobacter 
infection. 

A vaccine vector of the invention can express one or several 
polypeptides or derivatives of the invention, as well as at least one additional 
Helicobacter antigen such as a urease apoenzyme or a subunit, fragment, 
25 homolog, mutant, or derivative thereof. In addition, it can express a cytokine, 
such as interleukin-2 (IL-2) or interleukin-12 (IL-12), that enhances the 
immune response. Thus, a vaccine vector can include an additional 
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polynucleotide molecule encoding, e.g., urease subunit A, B, or both, or a 
cytokine, placed under the control of elements required for expression in a 
mammalian cell. 

Alternatively, a composition of the invention can include several 
5 vaccine vectors, each of which are capable of expressing a polypeptide or 
derivative of the invention. A composition can also contain a vaccine vector 
capable of expressing an additional Helicobacter antigen, such as urease 
apoenzyme, a subunit, fragment, homolog, mutant, or derivative thereof, or a 
cytokine such as IL-2 or IL-12. 

10 In vaccination methods for treating or preventing infection in a 

mammal, a vaccine vector of the invention can be administered by any 
conventional route in use in the vaccine field, for example, to a mucosal {e.g., 
ocular, intranasal, oral, gastric, pulmonary, intestinal, rectal, vaginal, or urinary 
tract) surface or via a parenteral (e.g., subcutaneous, intradermal, 

15 intramuscular, intravenous, or intraperitoneal) route, or a combination thereof. 
Preferred routes depend upon the choice of the vaccine vector. The 
administration can be achieved in a single dose or repeated at intervals. The 
appropriate dosage depends on various parameters that are understood by those 
skilled in the art, such as the nature of the vaccine vector itself, the route of 

20 administration, and the condition of the mammal to be vaccinated (e.g., the 
weight, age, and general health of the mammal). 

Live vaccine vectors that can be used in the invention include viral 
vectors, such as adenoviruses and poxviruses, as well as bacterial vectors, e.g., 
Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille bilie de Calmette- 

25 Guerin (BCG), and Streptococcus. An example of an adenovirus vector, as 

well as a method for constructing an adenovirus vector capable of expressing a 
polynucleotide molecule of the invention, is described in U.S. Patent No. 
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4,920,209. Poxvirus vectors that can be used in the invention include, e.g., 
vaccinia and canary pox viruses, which are described in U.S. Patent No. 
4,722,848 and U.S. Patent No. 5,364,773, respectively (also see, e.g., Tartaglia 
et al. 9 Virology 188:217, 1992, for a description of a vaccinia virus vector, and 

5 Taylor et al, Vaccine 13:539, 1995, for a description of a canary poxvirus 
vector). Poxvirus vectors capable of expressing a polynucleotide of the 
invention can be obtained by homologous recombination, as described in Kieny 
et al. (Nature 312:163, 1984) so that the polynucleotide of the invention is 
inserted into the viral genome under appropriate conditions for expression in 

10 mammalian cells. Generally, the dose of viral vector vaccine, for therapeutic 
or prophylactic use, can be from about lxlO 4 to about 1x10", advantageously 
from about lxlO 7 to about lxl 0 10 , or, preferably, from about lxl 0 7 to about 
lxlO 9 plaque-forming units per kilogram. Preferably, viral vectors are 
administered parenterally, for example, in 3 doses that are 4 weeks apart. 

1 5 Those skilled in the art will recognize that it is preferable to avoid adding a 
chemical adjuvant to a composition containing a viral vector of the invention 
and thereby minimizing the immune response to the viral vector itself. 

Non-toxicogenic Vibrio cholerae mutant strains that can be used in 
live oral vaccines are described by Mekalanos et al. (Nature 306:551, 1983) 

20 and in U.S. Patent No. 4,882,278 (strain in which a substantial amount of the 
coding sequence of each of the two ctxA alleles has been deleted so that no 
functional cholerae toxin is produced); WO 92/1 1354 (strain in which the irgA 
locus is inactivated by mutation; this mutation can be combined in a single 
strain with ctxA mutations); and WO 94/1533 (deletion mutant lacking 

25 functional ctxA and attRSl DNA sequences). These strains can be genetically 
engineered to express heterologous antigens, as described in WO 94/19482. 
An effective vaccine dose of a V. cholerae strain capable of expressing a 
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polypeptide or polypeptide derivative encoded by a polynucleotide molecule of 
the invention can contain, e.g., about lxl 0 s to about 1x10°, preferably about 
lxl 0 6 to about lxl 0 s , viable bacteria in an appropriate volume for the selected 
route of administration. Preferred routes of administration include all mucosal 
5 routes, but, most preferably, these vectors are administered intranasally or 
orally. 

Attenuated Salmonella typhimurium strains, genetically engineered 
for recombinant expression of heterologous antigens, and their use as oral 
vaccines, are described by Nakayama et al. (Bio/Technology 6:693, 1988) and 
10 in WO 92/1 1361. Preferred routes of administration for these vectors include 
all mucosal routes. Most preferably, the vectors are administered intranasally 
or orally. 

Others bacterial strains useful as vaccine vectors are described by 
High et al. (EMBO 1 1: 1991, 1992) and Sizemore et al (Science 270:299, 

1 5 1 995; Shigella flexneri); Medaglini et al. (Proc. Natl. Acad. Sci. USA 92:6868, 
1995; (Streptococcus gordonii); Flynn (Ceil. Mol. Biol. 40 (suppl. I):31, 1 194), 
and in WO 88/6626, WO 90/0594, WO 91/13157, WO 92/1796, and WO 
92/21376 (Bacille Calmette Guerin). In bacterial vectors, a polynucleotide of 
the invention can be inserted into the bacterial genome or it can remain in a free 

20 state, for example, carried on a plasmid. 

An adjuvant can also be added to a composition containing a 
bacterial vector vaccine. A number of adjuvants that can be used are known to 
those skilled in the art. For example, preferred adjuvants can be selected from 
the list provided below. 

25 According to a fourth aspect of the invention, there is also provided 

(i) a composition of matter containing a polynucleotide of the invention, 
together with a diluent or carrier; (ii) a pharmaceutical composition containing 
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a therapeutically or prophylactically effective amount of a polynucleotide of the 
invention; (iii) a method for inducing an immune response against Helicobacter 
in a mammal by administering to the mammal an immunogenically effective 
amount of a polynucleotide of the invention to elicit an immune response, e.g., 
5 a protective immune response to Helicobacter, and (iv) a method for 

preventing and/or treating a Helicobacter {e.g., H. pylori, H.felis, H. mustelae, 
or H. heilmanii) infection, by administering a prophylactic or therapeutic 
amount of a polynucleotide of the invention to an individual in need of such 
treatment. Additionally, the fourth aspect of the invention encompasses the use 
10 of a polynucleotide of the invention in the preparation of a medicament for 
preventing and/or treating Helicobacter infection. The fourth aspect of the 
invention preferably includes the use of a polynucleotide molecule placed 
under conditions for expression in a mammalian cell, e.g., in a plasmid that is 
unable to replicate in mammalian cells and to substantially integrate into a 
15 mammalian genome. 

Polynucleotides (for example, DNA or RNA molecules) of the 
invention can also be administered as such to a mammal as a vaccine. When a 
DNA molecule of the invention is used, it can be in the form of a plasmid that 
is unable to replicate in a mammalian cell and unable to integrate into the 
20 mammalian genome. Typically, a DNA molecule is placed under the control of 
a promoter suitable for expression in a mammalian cell. The promoter can 
function ubiquitously or tissue-specifically. Examples of non-tissue specific 
promoters include the early Cytomegalovirus (CMV) promoter (U.S. Patent 
No. 4,1 68,062) and the Rous Sarcoma Virus promoter (Norton et al, Molec. 
25 Cell Biol. 5:281, 1985). The desmin promoter (Li et aL, Gene 78:243, 1989; Li 
et al, J. Biol. Chem. 266:6562, 1991; Li et al, J. Biol. Chem. 268:10403, 
1993) is tissue-specific and drives expression in muscle cells. More generally, 
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useful promoters and vectors are described, e.g., in WO 94/21797 and by 
Hartikka et al. (Human Gene Therapy 7: 1205, 1996). 

For DN A/RNA vaccination, the polynucleotide of the invention can 
encode a precursor or a mature form of a polypeptide of the invention. When it 
5 encodes a precursor form, the precursor sequence can be homologous or 

heterologous. In the latter case, a eucaryotic leader sequence can be used, such 
as the leader sequence of the tissue-type plasminogen factor (tPA). 

A composition of the invention can contain one or several 
polynucleotides of the invention. It can also contain at least one additional 

10 polynucleotide encoding another Helicobacter antigen, such as urease subunit 
A, B, or both, or a fragment, derivative, mutant, or analog thereof. A 
polynucleotide encoding a cytokine, such as interleukin-2 (IL-2) or interleukin- 
12 (IL-12), can also be added to the composition so that the immune response 
is enhanced. These additional polynucleotides are placed under appropriate 

15 control for expression. Advantageously, DNA molecules of the invention 

and/or additional DNA molecules to be included in the same composition are 
carried in the same plasmid. 

Standard methods can be used in the preparation of therapeutic 
polynucleotides of the invention. For example, a polynucleotide can be used in 

20 a naked form, free of any delivery vehicles, such as anionic liposomes, cationic 
lipids, microparticles, e.g., gold microparticles, precipitating agents, e.g., 
calcium phosphate, or any other transfecti on-facilitating agent. In this case, the 
polynucleotide can be simply diluted in a physiologically acceptable solution, 
such as sterile saline or sterile buffered saline, with or without a carrier. When 

25 present, the carrier preferably is isotonic, hypotonic, or weakly hypertonic, and 
has a relatively low ionic strength, such as provided by a sucrose solution, e.g., 
a solution containing 20% sucrose. 
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Alternatively, a polynucleotide can be associated with agents that 
assist in cellular uptake. It can be, e.g., (i) complemented with a chemical 
agent that modifies cellular permeability, such as bupivacaine (see, e.g., 
WO 94/16737), (ii) encapsulated into liposomes, or (iii) associated with 

5 cationic lipids or silica, gold, or tungsten microparticles. 

Anionic and neutral liposomes are well-known in the art (see, e.g., 
Liposomes: A Practical Approach, RPC New Ed, IRL Press, 1990, for a 
detailed description of methods for making liposomes) and are useful for 
delivering a large range of products, including polynucleotides. 

1 o Cationic lipids can also be used for gene delivery. Such lipids 

include, for example, Lipofectin™, which is also known as DOTMA (N-[l- 
(2,3-dioleyloxy)propyl]-N,N,N-trimethylammonium chloride), DOTAP (1,2- 
bis(oleyloxy)-3-(trimethylammonio)propane), DDAB (dimethyldioctadecyl- 
ammonium bromide), DOGS (dioctadecylamidologlycyl spermine), and 

15 cholesterol derivatives. A description of these cationic lipids can be found in 
EP 187,702, WO 90/11092, U.S. Patent No. 5,283,185, WO 91/15501, 
WO 95/26356, and U.S. Patent No. 5,527,928. Cationic lipids for gene 
delivery are preferably used in association with a neutral lipid, such as DOPE 
(dioleyl phosphatidylethanolamine; WO 90/1 1092). Other transfection- 

20 facilitating compounds can be added to a formulation containing cationic 
liposomes. A number of them are described in, e.g., WO 93/1 8759, . 
WO 93/19768, WO 94/25608, and WO 95/2397. They include, e.g., spermine 
derivatives useful for facilitating the transport of DNA through the nuclear 
membrane (see, for example, WO 93/18759) and membrane-permeabilizing 

25 compounds such as GALA, Gramicidine S, and cationic bile salts (see, for 
example, WO 93/19768). 
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Gold or tungsten microparticles can also be used for gene delivery, 
as described in WO 91/359, WO 93/17706, and by Tang et al (Nature 356:152, 
1992). In this case, the microparticle-coated polynucleotides can be injected 
via intradermal or intraepidermal routes using a needleless injection device 

5 ("gene gun"), such as those described in U.S. Patent No. 4,945,050, U.S. Patent 
No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine depends, e.g., on the 
strength of the promoter used in the DNA construct, the immunogenicity of the 
expressed gene product, the condition of the mammal intended for 

10 administration (e.g., the weight, age, and general health of the mammal), the 
mode of administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 yug to about 
1 mg, preferably, from about 10 jug to about 800 pig, and, more preferably, 
from about 25 pg to about 250 ,ug, can be administered to a human adult. The 

1 5 administration can be achieved in a single dose or repeated at intervals. 

The route of administration can be any conventional route used in the 
vaccine field. As general guidance, a polynucleotide of the invention can be 
administered via a mucosal surface, e.g., an ocular, intranasal, pulmonary, oral, 
intestinal, rectal, vaginal, or urinary tract surface, or via a parenteral route, e.g., 

20 by an intravenous, subcutaneous, intraperitoneal, intradermal, intraepidermal, 
or intramuscular route. The choice of administration route will depend on, e.g., 
the formulation that is selected. A polynucleotide formulated in association 
with bupivacaine is advantageously administered into muscle. When a neutral 
or anionic liposome or a cationic lipid, such as DOTMA, is used, the 

25 formulation can be advantageously administered via intravenous, intranasal (for 
example, by aerosolization), intramuscular, intradermal, and subcutaneous 
routes. A polynucleotide in a naked form can advantageously be administered 
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via the intramuscular, intradermal, or subcutaneous routes. Although not 
absolutely required, such a composition can also contain an adjuvant. A 
systemic adjuvant that does not require concomitant administration in order to 
exhibit an adjuvant effect is preferable. 
5 The sequence information provided in the present application enables 

the design of specific nucleotide probes and primers that can be used in 
diagnostic methods. Accordingly, in a fifth aspect of the invention, there is 
provided a nucleotide probe or primer having a sequence found in, or derived 
by degeneracy of the genetic code from, a sequence shown in any of SEQ ID 
10 NOs : 1 - 1 69 (odd numbers) . 

The term "probe" as used in the present application refers to a DNA 
(preferably single stranded) or RNA molecule (or modifications or 
combinations thereof) that hybridizes under the stringent conditions, as defined 
above, to a polynucleotide molecule having a sequence homologous to any of 
1 5 those shown in SEQ ID NOs: 1 - 1 69 (odd numbers), or to a complementary or 
anti-sense sequence of any of those shown in SEQ ID NOs: 1 -169 (odd 
numbers). Generally, probes are significantly shorter than the full-length 
sequences shown in SEQ ID NOs: 1-169 (odd numbers). For example, they can 
contain from about 5 to about 100, preferably from about 10 to about 80, 
20 nucleotides. In particular, probes have sequences that are at least 75%, 

preferably at least 85%, more preferably 95% homologous to a portion of a 
sequence as shown in any of SEQ ID NOs: 1-1 69 (odd numbers) or a sequence 
complementary to any of such sequences. 

Probes can contain modified bases, such as inosine, methyl-5- 
25 deoxycytidine, deoxyuridine, dimethylamino-5-deoxyuridine, or diamino-2, 6- 
purine. Sugar or phosphate residues can also be modified or substituted. For 
example, a deoxyribose residue can be replaced by a polyamide (Nielsen et al, 



BNSDOCID: <WO 9821225A1J_> 



WO 98/21225 PCTAJS97/21353 

-30- 

Science 254:1497, 1991) and phosphate residues can be replaced by ester 
groups, such as diphosphate, alkyl, arylphosphonate, and phosphorothioate 
esters. In addition, the 2'-hydroxyl group on ribonucleotides can be modified 
by addition of, e.g., alkyl groups. 
5 Probes of the invention can be used in diagnostic tests or as capture 

or detection probes. Such capture probes can be immobilized on solid supports, 
directly or indirectly, by covalent means or by passive adsorption. A detection 
probe can be labeled by a detectable label, for example, a label selected from 
radioactive isotopes; enzymes, such as peroxidase and alkaline phosphatase; 

10 enzymes that are able to hydrolyze a chromogenic, fluorogenic, or luminescent 
substrate; compounds that are chromogenic, fluorogenic, or luminescent; 
nucleotide base analogs; and biotin. 

Probes of the invention can be used in any conventional 
hybridization method, such as in dot blot methods (Maniatis et al. y Molecular 

15 Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York, 1982), Southern blot methods (Southern, J. Mol. 
Biol. 98:503, 1975), northern blot methods (identical to Southern blot to the 
exception that RNA is used as a target), or a sandwich method (Dunn et aL, 
Cell 12:23, 1977). As is known in the art, the latter technique involves the use 

20 of a specific capture probe and a specific detection probe that have nucleotide 
sequences that are at least partially different from each other. 

Primers used in the invention usually contain about 10 to 
40 nucleotides and are used to initiate enzymatic polymerization of DNA in an 
amplification process (e.g., PCR), an elongation process, or a reverse 

25 transcription method. In a diagnostic method involving PCR, the primers can 
be labeled. 
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Thus, the invention also encompasses (i) a reagent containing a 
probe of the invention for detecting and/or identifying the presence of 
Helicobacter in a biological material; (ii) a method for detecting and/or 
identifying the presence of Helicobacter in a biological material, in which (a) a 
5 sample is recovered or derived from the biological material, (b) DNA or RNA 
is extracted from the material and denatured, and (c) the sample is exposed to a 
probe of the invention, for example, a capture probe, a detection probe, or both, 
under stringent hybridization conditions, so that hybridization is detected; and 
(iii) a method for detecting and/or identifying the presence of Helicobacter in a 

10 biological material, in which (a) a sample is recovered or derived from the 

biological material, (b) DNA is extracted therefrom, (c) the extracted DNA is 
contacted with at least one, or, preferably two, primers of the invention, and 
amplified by the polymerase chain reaction, and (d) an amplified DNA 
molecule is produced. 

1 5 As mentioned above, polypeptides that can be produced by 

expression of the polynucleotides of the invention can be used as vaccine 
antigens. Accordingly, a sixth aspect of the invention features a substantially 
purified polypeptide or polypeptide derivative having an amino acid sequence 
encoded by a polynucleotide of the invention. 

20 A "substantially purified polypeptide" is defined as a polypeptide 

that is separated from the environment in which it naturally occurs and/or a 
polypeptide that is free of most of the other polypeptides that are present in the 
environment in which it was synthesized. The polypeptides of the invention 
can be purified from a natural source, such as a Helicobacter strain, or can be 

25 produced using recombinant methods. 

Homologous polypeptides or polypeptide derivatives encoded by 
polynucleotides of the invention can be screened for specific antigenicity by 
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testing cross-reactivity with an antiserum raised against a polypeptide having 
an amino acid sequence as shown in any of SEQ ID NOs:2-l 70 (even 
numbers). Briefly, a monospecific hyperimmune antiserum can be raised 
against a purified reference polypeptide as such or as a fusion polypeptide, for 
5 example, an expression product of MBP, GST, or His-tag systems, or a 

synthetic peptide predicted to be antigenic. The homologous polypeptide or 
derivative that is screened for specific antigenicity can be produced as such or 
as a fusion polypeptide. In the latter case, and if the antiserum is also raised 
against a fusion polypeptide, two different fusion systems are employed. 

10 Specific antigenicity can be determined using a number of methods, including 
Western blot (Towbin et aL 9 Proc. Natl. Acad. Sci. USA 76:4350, 1979), dot 
blot, and ELISA methods, as described below. 

In a Western blot assay, the product to be screened, either as a 
purified preparation or a total E. coli extract, is fractionated by SDS-PAGE, as 

1 5 described, for example, by Laemmli (Nature 227:680, 1970). After being 
transferred to a filter, such as a nitrocellulose membrane, the material is 
incubated with the monospecific hyperimmune antiserum, which is diluted in a 
range of dilutions from about 1:50 to about 1:5,000, preferably from about 
1 : 100 to about 1 :500. Specific antigenicity is shown once a band 

20 corresponding to the product exhibits reactivity at any of the dilutions in the 
range. 

In an ELISA assay, the product to be screened can be used as the 
coating antigen. A purified preparation is preferred, but a whole cell extract 
can also be used. Briefly, about 100 /A of a preparation of about 10 /ug 
25 protein/ml is distributed into wells of a 96-well ELISA plate. The plate is 
incubated for about 2 hours at 37 °C, then overnight at 4°C. The plate is 
washed with phosphate buffer saline (PBS) containing 0.05% Tween 20 
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(PBS/Tween buffer) and the wells are saturated with 250 jul PBS containing 
1% bovine serum albumin (BSA), to prevent non-specific antibody binding. 
After 1 hour of incubation at 37°C, the plate is washed with PBS/Tween buffer. 
The antiserum is serially diluted in PBS/Tween buffer containing 0.5% BSA, 
5 and 100 \A dilutions are added to each well. The plate is incubated for 
90 minutes at 37 °C, washed, and evaluated using standard methods. For 
example, a goat anti-rabbit peroxidase conjugate can be added to the wells 
when the specific antibodies used were raised in rabbits. Incubation is carried 
out for about 90 minutes at 37 °C and the plate is washed. The reaction is 
1 0 developed with the appropriate substrate and the reaction is measured by 

i 

colorimetry (absorbance measured spectrophotometrically). Under these 
I experimental conditions, a positive reaction is shown once an O.D. value of 1.0 

! is detected with a dilution of at least about 1 :50, preferably of at least about 

i 

1:500. 

15 In a dot blot assay, a purified product is preferred, although a whole 

cell extract can be used. Briefly, a solution of the product at a concentration of 
about 100 yug/ml is serially diluted two-fold with 50 mM Tris-HCl (pH 7.5). 
One hundred /A of each dilution is applied to a filter, such as a 0.45 yum 
nitrocellulose membrane, set in a 96-well dot blot apparatus (Biorad). The 

20 buffer is removed by applying vacuum to the system. Wells are washed by 
addition of 50 mM Tris-HCl (pH 7.5) and the membrane is air-dried. The 
membrane is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5), 0.15 M 
NaCl, 10 g/1 skim milk) and incubated with an antiserum diluted from about 
1 :50 to about 1 :5000, preferably about 1 :500. The reaction is detected using 

25 standard methods. For example, a goat anti-rabbit peroxidase conjugate can be 
added to the wells when rabbit antibodies are used. Incubation is carried out 
for about 90 minutes at 37 °C and the blot is washed. The reaction is developed 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 



-34- 

with the appropriate substrate and stopped. The reaction is then measured 
visually by the appearance of a colored spot, e.g., by colorimetry. Under these 
experimental conditions, a positive reaction is associated with detection of a 
colored spot for reactions carried out with a dilution of at least about 1 :50, 
5 preferably, of at least about 1 :500. Therapeutic or prophylactic efficacy of a 
polypeptide or polypeptide derivative of the invention can be evaluated as 
described below. 

According to a seventh aspect of the invention, there is provided (i) a 
composition of matter containing a polypeptide of the invention, together with 

1 0 a diluent or carrier; (ii) a pharmaceutical composition containing a 

therapeutically or prophylactically effective amount of a polypeptide of the 
invention; (iii) a method for inducing an immune response against Helicobacter 
in a mammal by administering to the mammal an immunogenically effective 
amount of a polypeptide of the invention to elicit an immune response, e.g., a 

1 5 protective immune response to Helicobacter; and (iv) a method for preventing 
and/or treating a Helicobacter (e.g. , H. pylori, H.felis, H. mustelae, or H. 
heilmanii) infection, by administering a prophylactic or therapeutic amount of a 
polypeptide of the invention to an individual in need of such treatment. 
Additionally, this aspect of the invention includes the use of a polypeptide of 

20 the invention in the preparation of a medicament for preventing and/or treating 
Helicobacter infection. 

The immunogenic compositions of the invention can be administered 
by any conventional route in use in the vaccine field, for example, to a mucosal 
(e.g., ocular, intranasal, pulmonary, oral, gastric, intestinal, rectal, vaginal, or 

25 urinary tract) surface or via a parenteral (e.g., subcutaneous, intradermal, 
intramuscular, intravenous, or intraperitoneal) route. The choice of the 
administration route depends upon a number of parameters, such as the 
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adjuvant used. For example, if a mucosal adjuvant is used, the intranasal or 
oral route will be preferred, and if a lipid formulation or an aluminum 
compound is used, a parenteral route will be preferred. In the latter case, the 
subcutaneous or intramuscular route is most preferred. The choice of 

5 administration route can also depend upon the nature of the vaccine agent. For 
example, a polypeptide of the invention fused to CTB or to LTB will be best 
administered to a mucosal surface. 

A composition of the invention can contain one or several 
polypeptides or derivatives of the invention. It can also contain at least one 

10 additional Helicobacter antigen, such as the urease apoenzyme, or a subunit, 
fragment, homolog, mutant, or derivative thereof. 

For use in a composition of the invention, a polypeptide or 
polypeptide derivative can be formulated into or with liposomes, such as 
neutral or anionic liposomes, microspheres, ISCOMS, or virus-like particles 

1 5 (VLPs), to facilitate delivery and/or enhance the immune response. These 
compounds are readily available to those skilled in the art; for example, see 
Liposomes: A Practical Approach {supra). Adjuvants other than liposomes can 
also be used in the invention and are well known in the art (see, for example, 
the list provided below). 

20 Administration can be achieved in a single dose or repeated as 

necessary at appropriate intervals that can be determined by those skilled in the 
art. For example, a priming dose can be followed by three booster doses at 
weekly or monthly intervals. An appropriate dose depends on various 
parameters, including the nature of the recipient (e.g., whether the recipient is 

25 an adult or an infant), the particular vaccine antigen, the route and frequency of 
administration, the presence/absence or type of adjuvant, and the desired effect 
{e.g., protection and/or treatment), and can be readily determined by one skilled 
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in the art. In general, a vaccine antigen of the invention can be administered 
mucosally in an amount ranging from about 10 fug to about 500 mg, preferably 
from about 1 mg to about 200 mg. For a parenteral route of administration, the 
dose usually should not exceed about 1 mg, and is, preferably, about 100 jug. 
5 When used as components of a vaccine, the polynucleotides and 

polypeptides of the invention can be used sequentially as part of a multi-step 
immunization process. For example, a mammal can be initially primed with a 
vaccine vector of the invention, such as a pox virus, e.g., via a parenteral route, 
and then boosted twice with a polypeptide encoded by the vaccine vector, e.g., 
10 via the mucosal route. In another example, liposomes associated with a 

polypeptide or polypeptide derivative of the invention can be used for priming, 
with boosting being carried out mucosally using a soluble polypeptide or 
polypeptide derivative of the invention, in combination with a mucosal 
adjuvant (e.g., LT). 

15 Polypeptides and polypeptide derivatives of the invention can also be 

used as diagnostic reagents for detecting the presence of BX&x-Helicobacter 
antibodies, e.g., in blood samples. Such polypeptides can be about 5 to about 
80, preferably, about 10 to about 50, amino acids in length and can be labeled 
or unlabeled, depending upon the diagnostic method. Diagnostic methods 

20 involving such a reagent are described below. 

Upon expression of a polynucleotide molecule of the invention, a 
polypeptide or polypeptide derivative is produced and can be purified using 
known methods. For example, the polypeptide or polypeptide derivative can be 
produced as a fusion protein containing a fused tail that facilitates purification. 

25 The fusion product can be used to immunize a small mammal, e.g., a mouse or 
a rabbit, in order to raise monospecific antibodies against the polypeptide or 
polypeptide derivative. The eighth aspect of the invention thus provides a 
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monospecific antibody that binds to a polypeptide or polypeptide derivative of 
the invention. 

By "monospecific antibody" is meant an antibody that is capable of 
reacting with a unique, naturally-occurring Helicobacter polypeptide. An 

5 antibody of the invention can be polyclonal or monoclonal. Monospecific 
antibodies can be recombinant, e.g., chimeric {e.g., consisting of a variable 
region of murine origin and a human constant region), humanized {e.g., a 
human immunoglobulin constant region and a variable region of animal, e.g., 
murine, origin), and/or single chain. Both polyclonal and monospecific 

10 antibodies can also be in the form of immunoglobulin fragments, e.g., F(ab) f 2 
or Fab fragments. The antibodies of the invention can be of any isotype, e.g., 
IgG or IgA, and polyclonal antibodies can be of a single isotype or can contain 
a mixture of isotypes. 

The antibodies of the invention, which can be raised against a 

15 polypeptide or polypeptide derivative of the invention, can be produced and 
identified using standard immunological assays, e.g., Western blot assays, dot 
blot assays, or ELISA (see, e.g., Coligan et al., Current Protocols in 
Immunology, John Wiley & Sons, Inc., New York, NY, 1994). The antibodies 
can be used in diagnostic methods to detect the presence of Helicobacter 

20 antigens in a sample, such as a biological sample. The antibodies can also be 
used in affinity chromatography methods for purifying a polypeptide or 
polypeptide derivative of the invention. As is discussed further below, the 
antibodies can also be used in prophylactic and therapeutic passive 
immunization methods. 

25 Accordingly, a ninth aspect of the invention provides (i) a reagent for 

detecting the presence of Helicobacter in a biological sample that contains an 
antibody, polypeptide, or polypeptide derivative of the invention; and (ii) a 
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diagnostic method for detecting the presence of Helicobacter in a biological 
sample, by contacting the biological sample with an antibody, a polypeptide, or 
a polypeptide derivative of the invention, so that an immune complex is 
formed, and detecting the complex as an indication of the presence of 
5 Helicobacter in the sample or the organism from which the sample was 

derived. The immune complex is formed between a component of the sample 
and the antibody, polypeptide, or polypeptide derivative, and any unbound 
material can be removed prior to detecting the complex. A polypeptide reagent 
can be used for detecting the presence of anti-Helicobacter antibodies in a 

10 sample, e.g., a blood sample, while an antibody of the invention can be used for 
screening a sample, such as a gastric extract or biopsy sample, for the presence 
of Helicobacter polypeptides. 

For use in diagnostic methods, the reagent (e.g., the antibody, 
polypeptide, or polypeptide derivative of the invention) can be in a free state or 

1 5 can be immobilized on a solid support, such as, for example, on the interior 
surface of a tube or on the surface, or within pores, of a bead. Immobilization 
can be achieved using direct or indirect means. Direct means include passive 
adsorption (i.e., non-covalent binding) or covalent binding between the support 
and the reagent. By "indirect means" is meant that an anti-reagent compound 

20 that interacts with the reagent is first attached to the solid support. For 

example, if a polypeptide reagent is used, an antibody that binds to it can serve 
as an anti-reagent, provided that it binds to an epitope that is not involved in 
recognition of antibodies in biological samples. Indirect means can also 
employ a ligand-receptor system, for example, a molecule, such as a vitamin, 

25 can be grafted onto the polypeptide reagent and the corresponding receptor can 
be immobilized on the solid phase. This concept is illustrated by the well 
known biotin-streptavidin system. Alternatively, indirect means can be used, 
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e.g., by adding to the reagent a peptide tail, chemically or by genetic 
engineering, and immobilizing the grafted or fused product by passive 
adsorption or covalent linkage of the peptide tail. 

According to a tenth aspect of the invention, there is provided a 
5 process for purifying from a biological sample a polypeptide or polypeptide 

derivative of the invention, which involves carrying out antibody-based affinity 
chromatography with the biological sample, wherein the antibody is a 
monospecific antibody of the invention. 

For use in a purification process of the invention, the antibody can be 

1 0 polyclonal or monospecific, and preferably is of the IgG type. Purified IgGs 
can be prepared from an antiserum using standard methods (see, e.g., Coligan 
etaL, supra). Conventional chromatography supports, as well as standard 
methods for grafting antibodies, are described, for example, by Harlow et al 
(Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 

15 Spring Harbor, New York, 1988). 

Briefly, a biological sample, such as an H. pylori extract, preferably 
in a buffer solution, is applied to a chromatography material, which is, 
preferably, equilibrated with the buffer used to dilute the biological sample, so 
that the polypeptide or polypeptide derivative of the invention (i.e., the antigen) 

20 is allowed to adsorb onto the material. The chromatography material, such as a 
gel or a resin coupled to an antibody of the invention, can be in batch form or in 
a column. The unbound components are washed off and the antigen is eluted 
with an appropriate elution buffer, such as a glycine buffer, a buffer containing 
a chaotropic agent, e.g., guanidine HC1, or a buffer having high salt 

25 concentration (e.g., 3 M MgCl 2 ). Eluted fractions are recovered and the 

presence of the antigen is detected, e.g., by measuring the absorbance at 280 
nm. 
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An antibody of the invention can be screened for therapeutic efficacy 
as follows. According to an eleventh aspect of the invention, there is provided 
(i) a composition of matter containing a monospecific antibody of the 
invention, together with a diluent or carrier; (ii) a pharmaceutical composition 

5 containing a therapeutically or prophylactically effective amount of a 
monospecific antibody of the invention, and (iii) a method for treating or 
preventing Helicobacter {e.g., H. pylori, H.felis, H. mustelae, or H. heilmanii) 
infection, by administering a therapeutic or prophylactic amount of a 
monospecific antibody of the invention to an individual in need of such 

10 treatment. In addition, the eleventh aspect of the invention includes the use of a 
monospecific antibody of the invention in the preparation of a medicament for 
treating or preventing Helicobacter infection. 

The monospecific antibody can be polyclonal or monoclonal, and is, 
preferably, predominantly of the IgA isotype. In passive immunization 

15 methods, the antibody is administered to a mucosal surface of a mammal, e.g., 
the gastric mucosa, e.g., orally or intragastrically, optionally, in the presence of 
a bicarbonate buffer. Alternatively, systemic administration, not requiring a 
bicarbonate buffer, can be carried out. A monospecific antibody of the 
invention can be administered as a single active agent or as a mixture with at 

20 least one additional monospecific antibody specific for a different Helicobacter 
polypeptide. The amount of antibody and the particular regimen used can be 
readily determined by one skilled in the art. For example, daily administration 
of about 100 to 1,000 mg of antibody over one week, or three doses per day of 
about 100 to 1,000 mg of antibody over two or three days, can be effective 

25 regimens for most purposes. 

Therapeutic or prophylactic efficacy can be evaluated using standard 
methods in the art, e.g., by measuring induction of a mucosal immune response 
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or induction of protective and/or therapeutic immunity using, e.g., the H. felis 
mouse model and the procedures described by Lee et al. (Eur. J. 
Gastroenterology & Hepatology 7:303, 1995) or Lee et al (J. Infect. Dis. 
172:161, 1995). Those skilled in the art will recognize that the H. felis strain of 

5 the model can be replaced with another Helicobacter strain. For example, the 
efficacy of polynucleotide molecules and polypeptides from H. pylori is, 
preferably, evaluated in a mouse model using an H. pylori strain. Protection 
can be determined by comparing the degree of Helicobacter infection in the 
gastric tissue assessed by, for example, urease activity, bacterial counts, or 

10 gastritis, to that of a control group. Protection is shown when infection is 

reduced by comparison to the control group. Such an evaluation can be made 
for polynucleotides, vaccine vectors, polypeptides, and polypeptide derivatives, 
as well as for antibodies of the invention. 

For example, various doses of an antibody of the invention can be 

15 administered to the gastric mucosa of mice previously challenged with an H. 
pylori strain as described, e.g., by Lee et al {supra). Then, after an appropriate 
period of time, the bacterial load of the mucosa can be estimated by assessing 
urease activity, as compared to a control. Reduced urease activity indicates that 
the antibody is therapeutically effective. 

20 Adjuvants that can be used in any of the vaccine compositions 

described above are described as follows. Adjuvants for parenteral 
administration include, for example, aluminum compounds, such as aluminum 
hydroxide, aluminum phosphate, and aluminum hydroxy phosphate. The 
antigen can be precipitated with, or adsorbed onto, the aluminum compound 

25 using standard methods. Other adjuvants, such as RIBI (ImmunoChem, 
Hamilton, MT), can also be used in parenteral administration. 
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Adjuvants that can be used for mucosal administration include, for 
example, bacterial toxins, e.g., the cholera toxin (CT), the E. coli heat-labile 
toxin (LT), the Clostridium difficile toxin A, the pertussis toxin (PT), and 
combinations, subunits, toxoids, or mutants thereof. For example, a purified 
5 preparation of native cholera toxin subunit B (CTB) can be used. Fragments, 
homologs, derivatives, and fusions to any of these toxins can also be used, 
provided that they retain adjuvant activity. Preferably, a mutant having 
reduced toxicity is used. Suitable mutants are described, e.g., in WO 95/1721 1 
(Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT mutant), and WO 

1 0 95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). Additional LT mutants 

that can be used in the methods and compositions of the invention include, e.g., 
Scr-63-Lys, AIa-69-Gly, Glu-1 10-Asp, and Glu-1 12-Asp mutants. Other 
adjuvants, such as the bacterial monophosphoryl lipid A (MPLA) of, e.g., E. 
coli, Salmonella minnesota, Salmonella typhimurium, or Shigella flexneri', 

1 5 saponins, and polylactide glycolide (PLG A) microspheres, can also be used in 
mucosal administration. Adjuvants useful for both mucosal and parenteral 
administration, such as polyphosphazene (WO 95/2415), can also be used. 

Any pharmaceutical composition of the invention, containing a 
polynucleotide, polypeptide, polypeptide derivative, or antibody of the 

20 invention, can be manufactured using standard methods. It can be formulated 
with a pharmaceutically acceptable diluent or carrier, e.g., water or a saline 
solution, such as phosphate buffered saline, optionally, including a bicarbonate 
salt, such as sodium bicarbonate, e.g., 0.1 to 0.5 M. Bicarbonate can 
advantageously be added to compositions intended for oral or intragastric 

25 administration. In general, a diluent or carrier can be selected on the basis of 
the mode and route of administration, and standard pharmacexitical practice. 
Suitable-pharmaceutical carriers and diluents, as well as pharmaceutical 
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necessities for their use in pharmaceutical formulations, are described in 
Remington's Pharmaceutical Sciences, a standard reference text in this field 
and in the USP/NF. 

The invention also includes methods in which gastroduodenal 

5 infections, such as Helicobacter infection, are treated by oral administration of 
a Helicobacter polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antisecretory agent, a bismuth salt, an 
antacid, sucralfate, or a combination thereof. Examples of such compounds 
that can be administered with the vaccine antigen and an adjuvant are 

10 antibiotics, including, e.g., macrolides, tetracyclines, p-lactams, 

aminoglycosides, quinolones, penicillins, and derivatives thereof (specific 
examples of antibiotics that can be used in the invention include, e.g., 
amoxicillin, clarithromycin, tetracycline, metronidizole, erythromycin, 
cefuroxime, and erythromycin); antisecretory agents, including, e.g., H 2 - 

15 receptor antagonists {e.g., cimetidine, ranitidine, famotidine, nizatidine, and 
roxatidine), proton pump inhibitors {e.g., omeprazole, lansoprazole, and 
pantoprazole), prostaglandin analogs {e.g., misoprostil and enprostil), and 
anticholinergic agents {e.g., pirenzepine, telenzepine, carbenoxolone, and 
proglumide); and bismuth salts, including colloidal bismuth subcitrate, 

20 tripotassium dicitrate bismuthate, bismuth subsalicylate, bicitropeptide, and 
pepto-bismol (see, e.g., Goodwin et al., Helicobacter pylori, Biology and 
Clinical Practice, CRC Press, Boca Raton, FL, pp 366-395, 1993; Physicians 1 
Desk Reference, 49 th edn., Medical Economics Data Production Company, 
Montvale, New Jersey, 1995). In addition, compounds containing more than 

25 one of the above-listed components coupled together, e.g., ranitidine coupled to 
bismuth subcitrate, can be used. The invention also includes compositions for 
carrying out these methods, i.e., compositions containing a Helicobacter 



BNSDOCID: <WO 9821225A1J_> 



WO 98/21225 - PCT/US97/21353 - 

-44- 

antigen (or antigens) of the invention, an adjuvant, and one or more of the 
above-listed compounds, in a pharmaceutical^ acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods and 
compositions of the invention can readily be determined by one skilled in the 
5 art. In addition, one skilled in the art can readily design 

treatment/immunization schedules. For example, the non-vaccine components 
can be administered on days 1-14, and the vaccine antigen + adjuvant can be 
administered on days 7, 14, 21, and 28. 

Methods and pharmaceutical compositions of the invention can be 
1 0 used to treat or to prevent Helicobacter infections and, accordingly, 

gastroduodenal diseases associated with these infections, including acute, 
chronic, and atrophic gastritis, and peptic ulcer diseases, e.g., gastric and 
duodenal ulcers. 

The clones of the invention were originally isolated by a transposon 
15 shuttle mutagenesis method. Briefly, in this method, a TnMax9 mini-blaM 
transposon was used for insertional mutagenesis of an H. pylori gene library 
established in E. coli. 192 E. coli clones expressing active P-lactamase fusion 
proteins were obtained, indicating that the corresponding target plasmids carry 
H. pylori genes encoding extracytoplasmic proteins. Individual mutants were 
20 transferred onto the chromosome of H. pylori PI or P 12 by natural 

transformation, resulting in 135 distinct H. pylori mutants. This method is 
described in further detail, as follows. 

The transposon TnMax9 (Kahrs et aL, Gene 167:53, 1995) was used 
to generate mutations in an H. pylori library in E. coli. As illustrated in Fig. 
25 1 A, TnMax9 contains, in addition to a cat GC -resistance gene close to the 

inverted repeat (IR), an unexpressed open reading frame encoding P-lactamase 
without a promoter or signal sequence (mature P-lactamase, blaM\ Kahrs et aL, 
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supra). For production of extracytoplasmic BlaM fusion proteins resulting in 
ampicillin-resistant (amp R ) clones, expression of the cloned H. pylori genes in 
E. coli is obligatory. The minimal vector pMin2 (Kahrs et al., supra', see Fig. 
IB), containing a weak constitutive promoter (P jga ) upstream of the multiple 

5 cloning site, was used for construction of the H. pylori library to ensure 
expression of//, pylori genes in E. coli. 

In construction of the library, H. pylori DNA was partially digested 
with Sau3A and Hpall, size fractionated by preparative agarose gel 
electrophoresis, and 3-6 kilobase fragments were ligated into the Bglll and Clal 

10 sites of pMin2. The library was introduced into E. coli strain E181(pTnMaxP), 
which is a derivative of HB101 containing the TviK4ax9 transposon, by 
electroporation. This generated approximately 2,400 independent 
transformants. More than 95% of the plasmids contained an insert of between 
3 and 6 kilobases, showing that the 1.7 megabase H. pylori chromosome was 

15 statistically covered. Since not every plasmid could be expected to contain a 
target gene carrying an export signal, the library was partitioned into a total of 
198 pools (24 pools of 20 clones and 174 pools of 1 1 clones). Using a cotton 
swab, either eleven or twenty individual colonies were inoculated in 0.5 ml LB 
medium in eppendorf tubes, vortexed, and 1 00 ml of the suspension was spread 

20 on LB agar plates supplemented with tetracycline and chloramphenicol to 

select for maintenance of both plasmids. Insertion of TnMax9 into the target 
plasmids was induced with 100 mM isopropyl-b-D-thiogalactoside (IPTG) 
separately for each pool (Haas et al. 9 Gene 130:23-21, 1993). Plasmids were 
transferred into EMS by triparental mating, in which 25 ml of the donor strain 

25 (E181), 25 ml of the mobilisator (HB101(pRK2013)), and 50 ml of the 

recipient strain (El 45) were mixed from corresponding bacterial suspensions 
(O.D. 550 = 10). The matings were performed for 2-3 hours at 37°C on 
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nitrocellulose filters, which were placed on LB plates. Bacteria were 
suspended in 1 ml LB and aliquots were spread on LB plates containing 
chloramphenicol, tetracycline, and rifampicin. Each pool gave rise to 
chloramphenicol-resistant transconjugates in EMS, demonstrating that both 

5 transposition and conjugation were successful. Generally, several thousand 
chloramphenicol-resistant transconjugates were obtained, but the number of 
amp R colonies varied in different pools, ranging from one to several hundred 
colonies. Two amp R colonies from each positive pool were isolated, plasmid 
DNA was extracted, and the DNA was characterized by further restriction 

1 0 analysis. Only those TnMax9 insertions of a single pool that mapped in 

obviously different plasmid clones, or in markedly different regions of the same 
clone, were used further. 

From 158 of the 198 pools, ampicillin-resistant E145 transconjugates 
were obtained (80%), showing that in several pools, TnMax9 inserted into 

15 expressed genes, resulting in production of extracytoplasmic BlaM fusion 

proteins. Thus, a total of 192 amp R El 45 clones could be isolated by conjugal 
transfer of plasmids from 198 pools. 

To analyze the mutant library, it was determined whether defined 
gene sequences inactivated by TvMax9 were represented once or several times 

20 in the whole library. Five transposon-containing plasmids conferring an amp R 
phenotype to EMS (pMu7, pMul3, pMu75, pMu94, and pMul 10) were 
randomly selected and DNA fragments flanking the TnMax9 insert were 
isolated and used as probes in Southern hybridization of 120 amp R clones. The 
hybridization probes isolated from clones pMu7, pMu75, and pMu94 were 

25 between 0.9 and 1.1 kilobases in size, and hybridized exclusively with the 

inserts of the homologous plasmids. In contrast, the TnMax9 flanking regions 
of clones pMul3 and pMul 10 were 4.0 and 5.5 kilobases, respectively. They 
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each hybridized with the homologous plasmids, and with one additional clone 
of the library . Such a result was expected, since the chance of a probe to find a 
homologous sequence in the library should be higher, the longer the 
hybridization probes. 

i 

I 5 In order to verify the insertion of the transposon into distinct ORFs 

i 

encoding putative exported proteins, the TnMax9- flanking DNA of five 
representative amp R mutant clones (pMu7, pMul2, pMul 8, pMu20, and 
! pMu26) was sequenced, taking advantage of the Ml 3 forward and reverse 

1 primers on TnMax9 (Fig. 1 A). This analysis revealed that the mini-transposon 

10 was inserted into different sequences in each plasmid, thereby interrupting 
ORFs encoding putative proteins. For two clones, the sequences located 
upstream of the blaM gene revealed a putative ribosome-binding site and a 
potential translational start codon (ATG). Other clones either revealed an ORF 
spanning the complete sequence (approximately 400 base pairs upstream and 
1 5 downstream of the TnMax9 insertion) or terminating shortly after the site of 
TnMax9 insertion. The partial protein sequences from different ORFs were 
used for database searches, but no significant homologies with known proteins 
were found. 

i 

In a further approach, it was determined whether a known gene, like 
20 vac A, encoding the extracellular vacuolating cytotoxin ofH. pylori, could be 
identified using this method and how often such a mutation would be 
represented in the mutant library. Total cell lysates of the 135 mutants were 
i tested in an immunoblot using the H. pylori cytotoxin-specific rabbit antiserum 

AK197 (Schmitt et al, Mol. Microbiol. 12:307-319, 1994). Two mutants were 
25 identified that no longer produced the cytotoxin antigen (mutants PI -26 and 

PI -47) and partial DNA sequencing of the insertion sites revealed that TnMax9 
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was inserted at distinct positions in the vacA gene, 56 and 53 codons 
downstream of the ATG start codon. 

Thus, the characterization of the mutant collection confirmed that a 
representative gene library was constructed in E. coli, in which target genes 
5 encoding exported H. pylori proteins were efficiently tagged by TnMaxP. 

In order to establish a collection of mutants lacking distinct exported 
proteins, the mutations had to be transferred back into the H. pylori 
chromosome. By means of natural transformation, 86 plasmids could be 
transformed into the original strain PI . H, pylori strains PI or PI 2, which were 

10 naturally competent for DNA transformation, were transformed with circular 
plasmid DNA (0.2-0.5 mg/transformation). Transformations to streptomycin 
resistance were performed with chromosomal DNA (1 mg/transformation), 
isolated from a streptomycin-resistant NCTC1 1637 H. pylori mutant according 
to the procedure described in Haas et aL (Mol. Microbiol. 8:753-760). 

15 Selection was performed on serum plates containing 4 mg/ml chloramphenicol 
or 500 mg/ml streptomycin. The transformation frequency for a given mutant 
was calculated as the number of chloramphenicol-, streptomycin-, or 
erythromycin-resistant colonies per cfu (average of three experiments). The 
blaM gene was deleted by Noil digestion, and the plasmid religated, in those 

20 plasmids that did not transform strain PI directly. This procedure, which 
resulted in a twenty- to thirty-fold higher frequency of transformation, as 
compared to the same plasmid containing blaM, resulted in 36 additional 
mutant PI strains. The 6/aM-deletion plasmids that still did not transform 
strain PI were used to transform the heterologous H. pylori strain PI 2, 

25 possessing an approximately 10-fold higher transformation frequency 
compared to PI. This resulted in thirteen further mutants. 
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Thus, from the 192 amp R plasmids, a total of 135 H. pylori mutants 
(122 mutants in PI and 13 mutants in PI 2) were finally obtained by selection 
for chloramphenicol resistance (70%). The transformation frequency varied 
between different plasmids in the range of lxl 0' 5 - lxl 0~ 7 . The remaining 
5 plasmids did not result in any transformants. The collection was frozen as 

individual mutants in stock cultures at -70 °C. To verify the correct insertion of 
the mini-transposon into the H. pylori chromosome, ten representative mutants 
were tested by Southern hybridization of chromosomal DNA using cat GC DNA 
and the vector pMin2 as probes. Consistent with our previous experience 
1 0 concerning TnMaxP-based shuttle mutagenesis of H. pylori, the mini- 
transposon was, in all cases, inserted into the chromosome without integration 
of the vector DNA, which probably means by a double cross-over, rather than 
by a single cross-over event. As judged from the hybridization pattern obtained 
with the cat gene as a probe, it appears that TnMax9 is located in different 
1 5 regions of the chromosome, showing that distinct target genes have been 
interrupted in individual mutants. 

The mutants were analyzed for motility, transformation competence, 
and adherence to KatoIII cells. Screening of the H. pylori mutant collection 
allowed identification of mutants impaired in motility, natural transformation 
20 competence, and adherence to gastric epithelial cell lines. Motility mutants 
could be grouped into distinct classes: (i) mutants lacking the major flagellin 
subunit FlaA and intact flagella; (ii) mutants with apparently normal flagella, 
but reduced motility; and (iii) mutants with obviously normal flagella, but 
completely abolished motility. Two independent mutations, which exhibited 
25 defects in natural competence for genetic transformation, mapped to different 
genetic loci. In addition, two independent mutants were isolated by their 
failure to bind to the human gastric carcinoma cell line KatoIII. Both mutants 
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carried a transposon in the same gene, approximately 0.8 kilobases apart, and 
showed decrease autoagglutination, when compared to the wild type strain. 

Sequences of clones obtained using the above-described transposon 
shuttle mutagenesis method were used to identify intact genes, lacking inserted 
5 transposons, in the H. pylori genome, as is described below in Example 5. 

The invention is further illustrated by the following examples. 
Example 1 describes identification of genes, such as genes that encode the 
polypeptides of the invention, in the Helicobacter genome, as well as 
identification of signal sequences and primer design for amplification of genes 

10 lacking signal sequences. Example 2 describes cloning of DN A encoding 
GHPO 732, GHPO 419, GHPO 1398, GHPO 706, GHPO 1 190, GHPO 986, 
GHPO 1420, GHPO 1299, and GHPO 13 into a vector that provides a histidine 
tag, and production and purification of the resulting his-tagged fusion proteins. 
Example 3 describes methods for cloning DNA encoding the polypeptides of 

1 5 the invention so that they can be produced without his-tags, and Example 4 
describes methods for purifying recombinantly produced polypeptides of the 
invention. Example 5 describes methods for obtaining the nucleic acids of the 
invention from the deposited clones. Example 6 describes purification of 
recombinant H. pylori antigen GHPO 1 190. 

20 EXAMPLE 1 : Identification of genes in the H. pylori genome, 

identification of signal sequences, and primer design for amplification of 

genes lacking signal sequences 

1 .A. Creating H. pylori genomic databases 

The H. pylori genome was provided as a text file containing a single 
25 contiguous string of nucleotides that had been determined to be 1 .76 

Megabases in length. The complete genome was split into 1 7 separate files 
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using the program SPLIT (Creativity in Action), giving rise to 16 contigs, each 
containing 100,000 nucleotides, and a 17 th contig containing the remaining 
76,000 nucleotides. A header was added to each of the 1 7 files using the 
format: >hpg0.txt (representing contig 1), .hpgl.txt (representing contig 2), etc. 

5 The resulting 17 files, named hpgO through hpgl6, were then copied together to 
form one file that represented the plus strand of the complete H. pylori genome. 
The constructed database was given the designation "H." A negative strand 
database of the H. pylori genome was created similarly by first creating a 
reverse complement of the positive strand using the program SeqPup (D.G. 

1 0 Gilbert, Indiana University Biology Department) and then performing the same 
procedure as described above for the plus strand. This database was given the 
designation "N." 

The regions predicted to encode open reading frames (ORFs) were 
defined for the complete K pylori genome using the program GENEMARK™ 

15 (Borodovsky et al, Comp. Chem. 17:123, 1993). A database was created from 
a text file containing an annotated version of all ORFs predicted to be encoded 
by the H. pylori genome for both the plus and minus strands, and was given the 
designation "O " Each ORF was assigned a number indicating its location on 
the genome and its position relative to other genes. No manipulation of the text 

20 file was required. 

I.B. Searching the H. pylori databases 

The databases constructed as is described above were searched using 
the program FASTA (Pearson et al. 9 Proc. Natl. Acad. Sci. USA 85:2444-2448, 
1988). FASTA was used for searching either a DNA sequence against either of 
25 the gene databases ("H" and/or "N"), or a peptide sequence against the ORF 
library ("O"). TFASTX was used to search a peptide sequence against all 
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possible reading frames of a DNA database ("H" and/or "N" libraries). 
Potential frameshifts also being resolved, FASTX was used for searching the 
translated reading frames of a DNA sequence against either a DNA database, or 
a peptide sequence against the protein database. 

5 l.C. Isolatiom of DNA sequences from the H. pylori genome 

The FASTA searches against the constructed DNA databases 
identified exact nucleotide coordinates on one or more of the isolated contigs, 
and therefore the location of the target DNA. Once the exact location of the 
target sequence was known, the contig identified to carry the gene was 

10 exported into the software package MapDraw (DNAStar, Inc.) and the gene 
was isolated. Gene sequences with flanking DNA were then excised and 
copied into the EditSeq. Software package (DNAStar, Inc.) for further analysis. 

I.D. Identification of signal seqaaeeces 

The deduced protein encoded by a target gene sequence was 

15 analyzed using the PROTEAN software package (DNAStar, Inc.). This 

analysis predicts those areas of the protein that are hydrophobic by using the 
Kyte-Doolittle algorithm, and identifies any potential polar residues preceding 
the hydrophobic core region, which is typical for many signal sequences. For 
confirmation, the target protein was then searched against a PROSITE database 

20 (DNAStar, Inc.) consisting of motifs and signatures. Characteristic of many 
signal sequences and hydrophobic regions in general, is the identification of 
predicted prokaryotic lipid attachment sites. Where confirmation between the 
two approaches is apparent at the N-terminus of any protein, putative cleavage 
sites were sought. Specifically, this includes the presence of either an Alanine 

25 (A), Serine (S), or Glycine (G) residue immediately after the core hydrophobic 
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region. In the case of lipoproteins, a Cysteine (C) residue would be identified 
as the +1 residue, post-cleavage. 

1. E. Rational design of PCR primers based on the identification of signal 
sequences 

5 To clone gene sequences as N-terminal translational fusions for the 

generation of recombinant proteins with N-terminal Histidine tags, the gene 
sequence that specifies the signal sequence is omitted. The 5-end of the gene- 
specific portion of the N-terminal primer is designed to start at the first codon 
beyond the cleavage site. In the case of lipoproteins, the 5'-end of the N- 

1 0 terminal primer begins at the second codon, immediately after the modifiable 
residue at position +1 post-cleavage. The omission of the signal sequence from 
the recombinant allows for one-step purification, and potential problems 
associated with insertion of signal sequences in the membrane of the host strain 
carrying the hybrid construct are avoided. 

EXAMPLE 2: Preparation of isolated DNA encoding GHPO 732, GHPO 
419, GMPO 1398, GHPO 706, GHPO 1190, GHPO 986, GHPO 1420, 
GHPO 1299, and GHPO 13, and production of these polypeptides as 
histidiee-tagged fusion proteins 

2. A. Preparation of genomic BNA from Helicobacter pylori 
Helicobacter pylori strain ORV2001, stored in LB medium 

containing 50% glycerol at -70 °C, is grown on Colombia agar containing 7% 
sheep blood for 48 hours under microaerophilic conditions (8-10% C0 2 , 5-7% 
Oo, 85-87% N 2 ). Cells are harvested, washed with phosphate buffer saline 
(PBS) (pH 7.2), and DNA is then extracted from the cells using the Rapid Prep 
Genomic DNA Isolation kit (Pharmacia Biotech). 
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2.B. PCR amplification 

DNA molecules encoding GHPO 732, GHPO 419, GHPO 1398, 
GHPO 706, GHPO 1 190, GHPO 986, GHPO 1420, GHPO 1299, and GHPO 
13 are amplified from genomic DNA, as can be prepared as is described above, 
5 by the Polymerase Chain Reaction (PCR) using the following primers: 
GHPO 732 THPO 64Y. 

N-terminal primer: 
5'-GCCGGATCCATGACTTATGGGTATGGGGAA-3' (SEQ ID NO: 171); 
and 

1 0 C-terminal primer: 

5'-GCCCTCGAGACTTTTATTGATTCACCATTTCATT-3' (SEQ ID 
NO: 172). 

GHPO 419 (HPQ 54^: 

N-terminal primer: 

1 5 5'-GCCGGATCCATCGCTGAAGAAAATGGGGCG-3' (SEQ ID NO: 1 73); 
and 

C-terminal primer: 
5'-GCCCGGCCGCCCTAAAAACTATAAACATAACTC-3' (SEQ ID 
NO: 174). 
20 GHPO 1398 CHPO \5Y. 

N-terminal primer: 

5-GCCGGATCCGGTATTAGGAAGCTTATACCATC-3' (SEQ ID NO: 175); 
and 

C-terminal primer: 

25 5'-GCCCTCGAGAAGTTCTATTTTTAATTCCTTGAGAG-3' (SEQ ID 
NO: 176). 

GHPO 706 (HPO 50Y. 



BNSDOCID: <WO 9621225A1 J_> 



WO 98/21225 



PCT/US97/21353 - 



-55- 

N -terminal primer: 

5'-GCCGGATCCTCTGATAGCCATAAAGAAAAAAAGGAC-3' (SEQ ID 
NO: 177); and 
C-terminal primer: 

5 5'-GCCCTCGAGATCTTTAGAAATCAACCCCCAAAGC-3' (SEQ ID 
NO: 178). 

GHPO 1 190 fHPO 761: 

N-terminal primer: 
5 '-GCCGG ATCCG ACTT AG AAC ATTTTA AC ACGCTC- 3 1 (SEQ ID 

10 NO: 179); 

and 

C-terminal primer: 
5'-GCCCTCGAGTCATTTTAAACGACTCAAAACAAA-3' (SEQ ID 

NO: 180). 
15 GHPO 986: 

N-terminal primer: 
5'-GCCGGATCCGGCCAAAGCGTGCGCACTTATTGG-3' (SEQ ID 

NO: 181); and 

C-terminal primer: 

20 5'-GCCCTCGAGTTATTGTTCCAACCCCCACGCATC-3' (SEQ ID NO: 1 82). 

GHPO 1420: 

N-terminal primer: 
5-GCCGGATCCAAGAGCAATGCTGATGACAAACC-3' (SEQ ID 

NO: 183); and 
25 C-terminal primer: 

5-GCCCTCGAGTTATGAGTTAAAGCCCCTTGTCC-3' (SEQ ID NO: 184). 

GHPO 1299: 
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N-terminal primer: 
5'-GCCGGATCCGAATCAGTAAAAACAGGAAAAAC-3' (SEQ ID 
NO: 185); and 

C-terminal primer: 

5 5 ! -GCCCTCGAGCGGCTCTTTGGAGTTTTATTG-3' (SEQ ID NO: 1 86). 
GHPO 13: 

N-terminal primer: 

S'-GCCGGATCCATCATTCCCTCTCGCTCTATGG-S' (SEQ ID NO: 187); 
and 

1 0 C-terminal primer: 

5 , -GCCCTCGAGACCTTAATGCGTTGCGTTTTCTTT-3 t (SEQ ID NO: 188). 

The N-terminal and C-terminal primers for each clone both include a 

5 ! clamp and a restriction enzyme recognition sequence for cloning purposes 

(BamHl (GGATCC) and Xhol (CTCGAG) or Notl (CGGCCG) recognition 
1 5 sequences). The N-terminal primer is designed so that the amplified product 

does not encode the signal sequence and the potential cleavage site. 

Amplification of gene-specific DNA is carried out using Pwo DNA 

Polymerase (Boehringer Mannheim), which is a proof-reading polymerase, 

according to general guidance provided by the manufacturer. Because of the 
20 exonuclease activity of the polymerase, two reaction mixtures (mixtures 1 and 

2) are first prepared separately and combined just prior to amplification. These 

mixtures are as follows: 
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Ingredient (final cone.) Mixture 1 (ul) Mixture 2 (^1) 

distilled H.O 160 79 

dNTPs (200 a*M each) 40 

1 Ox PCR buffer — 20 

5 primers (100 nM each) 1 
DNA template (200 ng) 2 
as obtained in 5. A. 

(lOx PCR buffer contains 100 mM Tris-HCl (pH 8.85), 250 mM KC1, 50 mM (NH 4 ) 2 S0 4 , 20 mM 
MgS0 4 ) 

10 Amplification is carried out as follows: 

Cycling conditions Temp CO Time (min.) Number of cycles 

Initial denaturing step 96 4 1 

Denaturing step 94 0.5 20 

Annealing step 50 1 20 

1 5 Extension step 72 1 20 

Final extension step 72 5 1 

2.C. Transformation and selection of transformants 

A single PCR product is thus amplified and is then digested at 37°C 
for 2 hours with BamHl and Xliol or Notl concurrently in a 20 fA reaction 

20 volume. The digested product is ligated to similarly cleaved pET28a 

(Novagen) that is dephosphorylated prior to the ligation by treatment with Calf 
Intestinal Alkaline Phosphatase (CIP). The gene fusion constructed in this 
manner allows one-step affinity purification of the resulting fusion protein 
because of the presence of histidine residues at the N-terminus of the fusion 

25 protein, which are encoded by the vector. 

The ligation reaction (20 fA) is carried out at 14°C overnight and 
then is used to transform 100 fA fresh E. coli XLl-blue competent cells 
(Novagen). The cells are incubated on ice for 2 hours, heat-shocked at 42 °C 
for 30 seconds, and returned to ice for 90 seconds. The samples are then added 

30 to 1 ml LB broth in the absence of selection and grown at 37 °C for 2 hours. 
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Th e cells are plated out on LB agar containing kanamycin (50 jug/ml) at a lOx 
and neat dilution and incubated overnight at 37 °C. The following day, 50 
colonies are picked onto secondary plates and incubated at 37 °C overnight. 

Five colonies are picked into 3 ml LB broth supplemented with 
5 kanamycin (100 yug/ml) and are grown overnight at 37°C. Plasmid DNA is 
extracted using the Quiagen mini-prep, method and is quantitated by agarose 
gel electrophoresis. 

PGR is performed with the gene-specific primers under the 
conditions set forth above and transformant DNA is confirmed to contain the 
1 0 desired insert. If PCR-positive, one of the five plasmid DNA samples (500 ng) 
extracted from the E. coli XLl-blue cells is used to transform competent BL21 
(ADE3) E. coli competent cells (Novagen; as described previously). 
Transformants (10) are picked onto selective kanamycin (50 |ag/mL) containing 
LB agar plates and stored as a research stock in LB containing 50% glycerol. 

15 2.D. Purifncatiomi of recombinant proteins 

One ml of frozen glycerol stock prepared as described in 2.C. is used to 
inoculate 50 ml of LB medium containing 25 jig/ml of kanamycin in a 250 ml 
Erlenmeyer flask. The flask is incubated at 37°C for 2 hours or until the 
absorbance at 600 nm (OD600) reaches 0.4-1.0. The culture is stopped from 

20 growing by placing the flask at 4°C overnight. The following day, 10 ml of the 
overnight culture are used to inoculate 240 ml LB medium containing 
kanamycin (25 fig/ml), with the initial ODgQO about 0.02-0.04. Four flasks are 
inoculated for each ORF. The cells are grown to an ODgQO °f 1 -0 (about 2 
hours at 37°C), a 1 ml sample is harvested by centrifugation, and the sample is 

25 analyzed by SDS-PAGE to detect any leaky expression. The remaining culture 
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is induced with 1 mM IPTG and the induced cultures are grown for an 
additional 2 hours at 37°C. 

The final ODgoO is taken and the cells are harvested by centrifugation at 
5,000 x g for 15 minutes at 4°C. The supernatant is discarded and the pellets 
5 are resuspended in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Two hundred and 
fifty ml of buffer are used for 1 liter of culture and the cells are recovered by 
centrifugation at 12,000 x g for 20 minutes. The supernatant is discarded and 
the pellets are stored at -45 °C. 

2. E. Protein purification 

10 Pellets obtained from 2.D. are thawed and resuspended in 95 ml of 50 

mM Tris-HCl (pH 8.0). Pefabloc and lysozyme are added to final 
concentrations of 100 |LiM and 100 jig/ml, respectively. The mixture is 
homogenized with magnetic stirring at 5°C for 30 minutes. Benzonase (Merck) 
is added at a 1 U/ml final concentration, in the presence of 10 mM MgCl2, to 

15 ensure total digestion of the DNA. The suspension is sonicated (Branson 
Sonifier 450) for 3 cycles of 2 minutes each at maximum output. The 
homogenate is centrifuged at 19,000 x g for 15 minutes and both the 
supernatant and the pellet are analyzed by SDS-PAGE to detect the cellular 
location of the target protein in the soluble or insoluble fractions, as is 

20 described further below. 



2.E.I. Soluble fraction 
If the target protein is produced in a soluble form (i.e., in the supernatant 
obtained in 2.E.) NaCl and imidazole are added to the supernatant to final 
concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, and 10 mM 
25 imidazole (buffer A). The mixture is filtered through a 0.45 fim membrane and 
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loaded onto an IMAC column (Pharmacia HiTrap chelating Sepharose; 1 nil), 
which has been charged with nickel ions according to the manufacturer's 
recommendations. After loading, the column is washed with 50 column 
volumes of buffer A and the recombinant target protein is eluted with 5 ml of 
5 buffer B (50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 500 mM imidazole). 

The elution profile is monitored by measuring the absorbance of the 
fractions at 280 nm. Fractions corresponding to the protein peak are pooled, 
dialyzed against PBS containing 0.5 M arginine, filtered through a 0.22 |im 
membrane, and stored at -45°C. 

10 2JE.2. InsoIuaMe fraction 

If the target protein is expressed in the insoluble fraction (pellets 
obtained from 2.E.), purification is conducted under denaturing conditions. 
NaCl, imidazole, and urea are added to the resuspended pellet to final 
concentrations of 50 mM Tris-HCl (pH 8.0), 0.5 M NaCl, 10 mM imidazole, 

1 5 and 6 M urea (buffer C). After complete solubilization, the mixture is filtered 
through a 0.45 fim membrane and loaded onto an IMAC column. 

The purification procedures on the IMAC column are the same as 
described in 2.E.I., except that 6 M urea is included in all buffers used and 10 
column volumes of buffer C are used to wash the column after protein loading, 

20 instead of 50 column volumes. 

The protein fractions eluted from the IMAC column with buffer D 
(buffer C containing 500 mM imidazole) are pooled. Arginine is added to the 
solution to final concentration of 0.5 M and the mixture is dialyzed against PBS 
containing 0.5 M arginine and various concentrations of urea (4 M, 3 M, 2 M, 1 

25 M, and 0.5 M) to progressively decrease the concentration of urea. The final 
dialysate is filtered through a 0.22 ^m membrane and stored at -45°C. 
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Alternatively, when the above purification process is not as efficient as it 
should be, two other processes may be used as follows. A first alternative 
involves the use of a mild denaturant, N-octyl glucoside (NOG). Briefly, a 
pellet obtained in 2.E. is homogenized in 5 mM imidazole, 500 mM sodium 

5 chloride, 20 mM Tris-HCl (pH 7.9) by microfluidization at a pressure of 1 5,000 
psi and is clarified by centrifugation at 4,000-5,000 x g. The pellet is 
recovered, resuspended in 50 mM NaP0 4 (pH 7.5) containing 1-2 % weight 
/volume NOG, and homogenized. The NOG-soluble impurities are removed by 
centrifugation. The pellet is extracted once more by repeating the preceding 

10 extraction step. The pellet is dissolved in 8 M urea, 50 mM Tris (pH 8.0). The 
urea-solubilized protein is diluted with an equal volume of 2 M arginine, 50 
mM Tris (pH 8.0), and is dialyzed against 1 M arginine for 24-48 hours to 
remove the urea. The final dialysate is filtered through a 0.22 jim membrane 
and stored at -45°C. 

15 A second alternative involves the use of a strong denaturant, such as 

guanidine hydrochloride. Briefly, a pellet obtained in 2.E. is homogenized in 5 
mM imidazole, 500 mM sodium chloride, 20 mM Tris-HCl (pH 7.9) by 
microfluidization at a pressure of 15,000 psi and clarified by centrifugation at 
4,000-5,000 x g. The pellet is recovered, resuspended in 6 M guanidine 

20 hydrochloride, and passed through an IMAC column charged with Ni". The 
bound antigen is eluted with 8 M urea (pH 8.5). Beta-mercaptoethanol is added 
to the eluted protein to a final concentration of 1 mM, then the eluted protein is 
passed through a Sephadex G-25 column equilibrated in 0.1 M acetic acid. 
Protein eluted from the column is slowly added to 4 volumes of 50 mM 

25 phosphate buffer (pH 7.0). The protein remains in solution. 
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2.F. Evaluation of the protective activity of the purified protein 

Groups of 1 0 Swiss Webster mice (Taconic Labs) are immunized 
rectally with 25 yug of the purified recombinant protein, admixed with 1 jug of 
cholera toxin (Berna) in physiological buffer. Mice are immunized on days 0, 
5 7, 14, and 21. Fourteen days after the last immunization, the mice are 

challenged with H. pylori strain ORV2001 grown in liquid media (the cells are 
grown on agar plates, as described in 2. A., and, after harvest, the cells are 
resuspended in Brucella broth; the flasks are then incubated overnight at 37 °C). 
Fourteen days after challenge, the mice are sacrificed and their stomachs are 
1 0 removed. The amount of H. pylori is determined by measuring the urease 
activity in the stomach and by culture. 

2.G. Production of monospecific polyclonal antibodies 
2.G.I. Hyperimmune rabbit antiserum 
New Zealand rabbits are injected both subcutaneously and 

1 5 intramuscularly with 1 00 jug of a purified fusion polypeptide, as obtained in 
2.E.1 . or 2.E.2., in the presence of Freund's complete adjuvant and in a total 
volume of approximately 2 ml. Twenty one and 42 days after the initial 
injection, booster doses, which are identical to priming doses, except that 
Freund's incomplete adjuvant is used, are administered in the same way. 

20 Fifteen days after the last injection, animal serum is recovered, 
decomplemented, and filtered through a 0.45 [xm membrane. 

2.G.2. Mouse hyperimmune ascites fflmid 

Ten mice are injected subcutaneously with 10-50 /ug of a purified fusion 
polypeptide as obtained in 2.E.1 . or 2.E.2., in the presence of Freund's complete 
25 adjuvant and in a volume of approximately 200 jul. Seven and 14 days after the 
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initial injection, booster doses, which are identical to the priming doses, except 
that Freund's incomplete adjuvant is used, are administered in the same way. 
Twenty one and 28 days after the initial infection, mice receive 50 jug of the 
antigen alone intraperitoneal^. On day 21, mice are also injected 
5 intraperitoneal^ with sarcoma 180/TG cells CM26684 (Lennette et aL, 

Diagnostic Procedures for Viral, Rickettsial, and Chlamydial Infections, 5th 
Ed. Washington DC, American Public Health Association, 1979). Ascites fluid 
is collected 10-13 days after the last injection. 

EXAMPLE 3: Methods for producing transcriptional fusioms lacking His- 
1 0 tags 

Methods for amplification and cloning of DNA encoding the 
polypeptides of the invention as transcriptional fusions lacking His-tags are 
described as follows. Two PCR primers for each clone are designed based 
upon the sequences of the polynucleotides that encode them (SEQ ID NOs: 1- 

15 1 69 (odd numbers)). These primers can be used to amplify DNA encoding the 
polypeptides of the invention from any Helicobacter pylori strain, including, 
for example, ORV2001 and the strain deposited as ATCC deposit number 
43579, as well as from other Helicobacter species. 

The N-terminal primers are designed to include the ribosome binding 

20 site of the target gene, the ATG start site, and any signal sequence and cleavage 
site. The N-terminal primers can include a 5' clamp and a restriction 
endonuclease recognition site, such as that for BamHl (GGATCC), which 
facilitates subsequent cloning. Similarly, the C-terminal primers can include a 
restriction endonuclease recognition site, such as that for Xhol (CTCGAG), 

25 which can be used in subsequent cloning, and a TAA stop codon. 
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Amplification of genes encoding the polypeptides of the invention is 
carried out using Thermalase DNA Polymerase under the conditions described 
above in Example 2. Alternatively, Vent DNA polymerase (New England 
Biolabs), Pwo DNA polymerase (Boehringer Mannheim), or Taq DNA 
5 polymerase (Appligene) can be used, according to instructions provided by the 
manufacturers. 

A single PCR product for each clone is amplified and cloned into 
appropriately cleaved pET 24 (e.g., BamHl-Xhol cleaved pET 24), resulting in 
construction of a transcriptional fusion that permits expression of the proteins 

1 0 without His-tags. The expressed products can be purified as denatured proteins 
that are refolded by dialysis into 1 M arginine. 

Cloning into pET 24 allows transcription of the genes from the T7 
promoter, which is supplied by the vector, but relies upon binding of the RNA- 
specific DNA polymerase to the intrinsic ribosome binding sites of the genes, 

15 and thereby expression of the complete ORE. The amplification, digestion, and 
cloning protocols are as described above for constructing translational fusions. 

Amplification of clone GHPO 1190 DNA 
Design of PCR primers for cloning 

Two PCR primers are designed based on the complete gene sequence 
20 (see table 1). The N-terminal primer (FC1) is designed to include the ribosome 
binding site of the target gene, the ATG start site, and the signal sequence (with 
cleavage site). It includes a clamp (GCC) at the 5' most end, and a Sacl 
recognition sequence (GAGCTC) for cloning purposes. 

The C-terminal primer (RN2) includes an Xhol recognition sequence for 
25 cloning purposes, and the natural TAA stop codon. 

N-terminal primer (FC1): 
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5'-GCCGAGCTCCAAGCAAAAAAATGTCAATTAAAAGGG-3 ! (SEQ ID 

NO: 189) 

C-terminal primer (RN2): 
5 f -GCCCTCGAGGTCTAAATTAGAATAAGTGTTGTT-3 f (SEQ ID NO: 190) 

Amplification of each specified gene can be achieved by employing 
FC1/RN2 primers for any of the genes described (see Table 1). 



10 



PCR conditions 

Amplification of gene-specific DNA is carried out using Pwo DNA 
Polymerase (Boehringer Mannheim) under the following conditions. Due to 
the exonuclease activity of the polymerase, two reaction mixtures are prepared 
separately and combined just prior to amplification. 



Reaction ingredients: 



15 



Ingredient (final cone.) 

distilled H.O 
dNTPs (200 uM each) 
10X buffer 
primer 1 (100 nM) 
primer 2 (100 nM) 
Template (200 ng) 



Mixture 1 (uY) 

160 
40 

1 
1 

2 



Mixture 2 (ul) 

79 

20 

0 



Cycling condition 
20 Initial denaturing step 
Denaturing step 
Annealing step 
Extension step 
Final extension step 



Temp CO 

96 

94 

50 

72 

72 



Timefmin) 
4 

0.5 
1 
1 
1 



Number of cycles 
1 

20 
20 
20 
1 



25 A single PCR product of 624 basepairs is amplified and cloned into 

Sacl-Xhol cleaved pET 24, allowing construction of a transcriptional fusion 
and expression of GHPO 1 190 antigen in the absence of a His-tag. In this 
instance, expressed product can be purified as a denatured protein that is re- 
folded'by dialysis into 1 M arginine. 
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Cloning into pET 24 allows transcription from the T7 promoter, 
supplied by the vector, but relies upon binding of the RNA-specific DNA 
polymerase to the intrinsic ribosome binding site for GHPO 1 190, and thereby 
expression of the complete ORF. The amplification, restriction, and cloning 
5 protocols are as previously described for constructing translational fusions. 

EXAMPLE 4: Purification of the polypeptides of the invention by 
iimmunoaffinity 

4.A. Purification of specific IgGs 

An immune serum, as prepared in section 2.G., is applied to a protein A 

1 0 Sepharose Fast Flow column (Pharmacia) equilibrated in 1 00 mM Tris-HCl 
(pH 8.0). The resin is washed by applying 10 column volumes of 100 mM 
Tris-HCl and 10 volumes of 10 mM Tris-HCl (pH 8.0) to the column. IgG 
antibodies are eluted with 0.1 M glycine buffer (pH 3.0) and are collected as 5 
ml fractions to which is added 0.25 ml 1 M Tris-HCl (pH 8.0). The optical 

15 density of the eluate is measured at 280 nm and the fractions containing the IgG 
antibodies are pooled, dialyzed against 50 mM Tris-HCl (pH 8.0), and, if 
necessary, stored frozen at 
-70°C. 

4.B. Preparation of the column 

20 An appropriate amount of CNBr-activated Sepharose 4B gel (1 g of 

dried gel provides for approximately 3.5 ml of hydrated gel; gel capacity is 
from 5 to 10 mg coupled IgG/ml of gel) manufactured by Pharmacia (17-0430- 
01 ) is suspended in 1 mM HC1 buffer and washed with a buchner by adding 
small quantities of 1 mM HC1 buffer. The total volume of buffer is 200 ml per 

25 gram of gel. 
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Purified IgG antibodies are dialyzed for 4 hours at 20±5°C against 
50 volumes of 500 mM sodium phosphate buffer (pH 7.5). The antibodies are 
then diluted in 500 mM phosphate buffer (pH 7.5) to a final concentration of 3 
mg/ml. 

5 IgG antibodies are mixed with the gel overnight at 5±3 °C. The gel is 

packed into a chromatography column and is washed with 2 column volumes of 
500 mM phosphate buffer (pH 7.5), and 1 column volume of 50 mM sodium 
phosphate buffer, containing 500 mM NaCl (pH 7.5). The gel is then 
transferred to a tube, mixed with 100 mM ethanolamine (pH 7.5) for 4 hours at 

10 room temperature, and washed twice with 2 column volumes of PBS. The gel 
is then stored in 1/10,000 PBS/merthiolate. The amount of IgG antibodies 
coupled to the gel is determined by measuring the optical density (OD) at 280 
nm of the IgG solution and the direct eluate, plus washings. 

4.C. Adsorption and elution of the antigen 

1 5 An antigen solution in 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, for 

example, the supernatant obtained in 3.E. or the solubilized pellet obtained in 
3.E., after centrifugation and filtration through a 0.45 yum membrane, is applied 
to a column equilibrated with 50 mM Tris-HCl (pH 8.0), 2 mM EDTA, at a 
flow rate of about 10 ml/hour. The column is then washed with 20 volumes of 

20 50 mM Tris-HCl (pH 8.0), 2 mM EDTA. Alternatively, adsorption can be 
achieved by mixing overnight at 5±3°C. 

The adsorbed gel is washed with 2 to 6 volumes of 10 mM sodium 
phosphate buffer (pH 6.8) and the antigen is eluted with 100 mM glycine buffer 
(pH 2.5). The eluate is recovered in 3 mL fractions, to each of which is added 

25 1 50 A<1 of 1 M sodium phosphate buffer (pH 8.0). Absorption is measured at 
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280 nm for each fraction; those fractions containing the antigen are pooled and 

stored at 

-20°C. 

EXAMPLE 5: Preparation off isolated BNA encoding the polypeptides of 
5 the invention from the deposited clones. 

As mentioned above, E. coli strains including plasmids containing 
nucleic acids encoding GHPO 1 190 (formerly HP076, ATCC# 98197), GHPO 
1212 (formerly HP018, ATCC# 98210), GHPO 1012 (formerly HP0121, 
ATCC# 98201), GHPO 1501 (formerly HP045, ATCC# 98208), GHPO 1688 

1 0 (formerly HPO 101, ATCC# 98 1 98), GHPO 346 (formerly HPO 1 1 6, ATCC# 
98200), GHPO 1200 (formerly HP07, ATCC# 9821 1), GHPO 1538 (formerly 
HPO104, ATCC# 98199), GHPO 1398 (formerly HP015, ATCC# 98214), 
GHPO 1001 (formerly HP058, ATCC# 98206), GHPO 470 (formerly 
HPO 132, ATCC# 98202), GHPO 689 (formerly HP09, ATCC# 98203), 

1 5 GHPO 1 550 (formerly HP038, ATCC# 98204), GHPO 1 620 (formerly 
HP087, ATCC# 98205), GHPO 574 (formerly HP071, ATCC# 98217), 
GHPO 329 (formerly HPO70, ATCC# 98219), GHPO 1374 (formerly HPO80, 
ATCC# 98215), GHPO 956 (formerly HP095 ATCC# 98216), HPO 98 
(ATCC# 9821 8), GHPO 1346 (formerly HP057, ATCC# 98220), GHPO 706 

20 (formerly HPO50, ATCC# 98207), GHPO 732 (formerly HP064, ATCC# 
98213), GHPO 419 (formerly HP054, ATCC# 98212), and GHPO 276 
(formerly HP042, ATCC# 98209) were deposited in E. coli strain DH5oc under 
the Budapest Treaty with the American Type Culture Collection (ATCC; 
Rockville, Maryland) on October 9, 1996 and were designated with accession 

25 numbers indicated in parentheses above. These plasmids each contain a 

genomic DNA Bglll-Clal insert from H. pylori strain PI or P12 (referred to as 
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69-A and 888-0 in Haas et aL, Mol. Microbiol. (1993) 8:753). Each of the 
inserts are disrupted by the presence of transposon TnMax9 (Kahrs et aL, Gene 
(1995) 167:53). DNA molecules lacking the transposon can be amplified from 
the plasmids using standard PCR techniques, such as inverse and recombinant 
5 PCR (see, e.g., Innis et aL, supra), so that a full-length H. pylori insert is 
reconstituted. For example, the H. pylori sequences flanking the transposon 
can each be amplified by PCR, and then ligated together to form the full-length 
H. pylori gene lacking the transposon. Primers that can be used in these 
methods for each of the twenty-four deposited clones of the invention are 
] 0 shown in Table 1 . The locations of insertion of the transposon in each of the 
deposited clones are between the nucleotides indicated in parentheses after the 
name of each clone, as follows: HPO101 (497-498), GHPO 1538 (428-429), 
GHPO 346 (433-444), GHPO 1012 (463-464), GHPO 132 (408-409), GHPO 
1212 (226-227), GHPO 1550 (347-348), GHPO 276 (372-373), GHPO 1501 
15 (299-300), GHPO 706 (29-293), GHPO 419 (351-352), GHPO 1346 (266-267), 
GHPO 1001 (434-435), GHPO 732 (224-225), GHPO 329 (1 14-115), GHPO 
574 (274-275), GHPO 1 190 (412-413), GHPO 1200 (349-350), GHPO 1374 
(105-106), GHPO 1620 (26-27), GHPO 956 (64-65), HPO 98 (43-44), and 
GHPO 689 (346-347). 

20 EXAMPLE 6: Purification of recombinant H. pylori antigen from GHPO 
1190. 

A pellet of E. coli expressing GHPO 1 190 is homogenized in 5 mM 
imidazole, 500 mM sodium chloride, 20 mM Tris-HCl (pH 7.9) by 
microfluidization at a pressure of 15,000 psi, and clarified by centrifugation at 
25 4000-5000g. 
Method 1 
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The pellet containing cloned protein is suspended in buffer containing 
2% N-octyl glucoside (NOG) and is homogenized. The NOG soluble protein is 
removed by centrifugation. The pellet is extracted one more time with 2% 
NOG. After centrifugation, the pellet is dissolved in 8 M urea. The urea 
5 solubilized protein is diluted with an equal volume of 2 M arginine and 
dialyzed against 1 M arginine for 24-48 hours to remove urea. The cloned 
protein remains in solution. SDS-PAGE and Coomassie staining, followed by 
densitometric scanning, shows that the protein is 80-85% pure cloned antigen. 
Method 2 

10 The pellet containing cloned protein is solubilized in 6 M guanidine 

hydrochloride and is passed through an IMAC column charged with Ni*"\ The 
bound antigen is eluted with 8 M urea (pH 8.5). (3-Mercaptoethanol is added to 
eluted protein to a final concentration of 1 mM, then passed through a 
Sephadex G-25 column equilibrated in 0.1 M acetic acid. Protein eluted from 

1 5 Sephadex G-25 column is slowly added to 4 volumes of 50 mM phosphate (pH 
7.0). The protein remains in solution. 

Purification of recombinant proteins 

Recombinant proteins expressed as Histidine-tagged fusion proteins can 
be solubilized and purified by using a metal affinity column (nickel column). 

20 The bound protein can be eluted with imidazole buffer, with or without urea, or 
by using low pH buffers, with or without urea. Urea or guanidine 
hydrochloride-denatured proteins can then be renatured using appropriate 
renaturing buffers. With a number of recombinant H. pylori antigens (HpaA 
and clone GHPO 1 190), renaturation conditions using arginine hydrochloride 

25 (0.25-1 M) have been determined. 



BNSDOCID: <WO 9821 225A1_I_> 



WO 98/21225 



PCT/US97/21353 - 



-71- 

Recombinant proteins without a His-tag can be solubilized and purified 
using immunoaffinity, ion-exchange, sizing, and/or hydrophobic 
chromatography. Proteins expressed as insoluble aggregates in inclusion 
bodies can be solubilized in denaturing agents, such as 8 M urea or 6 M 

5 guanidine hydrochloride. Appropriate folding and renaturation can readily be 
determined by one skilled in the art. 

The above pellet containing cloned protein is suspended in 50 mM 
NaP0 4 (pH 7.5) containing 1% weight/volume N-octyl glucoside (NOG) and 
mixed vigorously. The NOG soluble impurities are removed by centrifugation. 

1 0 The remaining pellet is extracted one more time with the 1 % NOG solution to 
further remove impurities. After centrifugation, the pellet is solubilized in 8 M 
urea, 50 mM Tris (pH 8.0). The Urea solubilized protein is diluted with an 
equal volume of 2 M Arginine, 50 mM Tris (pH 8.0), and is dialyzed against 1 
M Arginine, 50 mM Tris, 50 mM NaCl (pH 8.0) for 24-48 hours to remove 

15 urea. The cloned protein remains in solution following dialysis. SDS-PAGE 
and Coomassie staining followed by densitometric scanning shows that the 
protein is 80-85% pure cloned antigen. 

Other embodiments are within the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: ORAVAX , INC. 



(ii) TITLE OF THE INVENTION: HELICOBACTER POLYPEPTIDES 

AND CORRESPONDING POLYNUCLEOTIDE MOLECULES 

(iii) NUMBER OF SEQUENCES: 190 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Clark & Elbing LLP 

(B) STREET: 176 Federal Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110-2214 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: UNKNOWN 

(B) FILING DATE: 14 -NOV- 1997 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 08/749,051 

(B) FILING DATE: 14 -NOV- 1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/831,309 

(B) FILING DATE: l-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/834,705 

(B) FILING DATE: l-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/833,457 

(B) FILING DATE: l-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 08/881,227 

(B) FILING DATE: 24-JUN-1997 

(vii) PRIOR APPLICATION DATA: 
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( A) APPLICATION NUMBER: 08/902,615 

(B) FILING DATE: 29-JUL-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Clark, Paul T. 

(B) REGISTRATION NUMBER : 30,175 

(C) REFERENCE/ DOCKET NUMBER: 06132/028WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-428-0200 

(B) TELEFAX: 617-428-7045 

( C ) TELEX : 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 71... 940 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTATGACGAT TGTCTCGCTT T TAG AAAAC A CTCTAATCGC TTTTGAAAAA CAACAAAGGA 6 0 
AGGGATTTTA ATG AAA TTT TTA CGC TCT GTT TAT GCA TTT TGC TCC AGT 10 9 
Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser Ser 
15 10 

TGG GTA GGG ACG ATT GTT ATT GTG CTG TTG GTT ATC TTT TTT ATC GCG 15 7 

Trp Val Gly Thr lie Val lie Val Leu Leu Val He Phe Phe He Ala 
15 20 25 

CAA GCC TTT ATC ATT CCC TCT CGC TCT ATG GTT GGC ACG CTC TAT GAG 2 05 

Gin Ala Phe He He Pro Ser Arg Ser Met Val Gly Thr Leu Tyr Glu 
30 35 40 45 

GGC GAC ATG CTC TTT GTC AAA AAG TTT TCT TAC GGC ATA CCC ATT CCT 2 53 

Gly Asp Met Leu Phe Val Lys Lys Phe Ser Tyr Gly He Pro He Pro 
50 55 60 

AAA ATC CCA TGG ATT GAG CTT CCT GTT ATG CCT GAT TTT AAA AAT AAC 3 01 

Lys He Pro Trp He Glu Leu Pro Val Met Pro Asp Phe Lys Asn Asn 
65 70 75 

GGA CAT TTG ATA GAG GGG GAT CGC CCT AAG CGT GGC GAA GTG GTG GTG 34 9 

Gly His Leu He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val Val 
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80 85 90 

TTT ATC CCT CCC CAT GAA AAA AAG TCT TAC TAT GTT AAA AGG AAT TTT 3 97 

Phe lie Pro Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn Phe 
95 100 105 

GCC ATT GGA GGC GAT GAG GTG TTG TTC ACT AAT GAG GGT TTT TAT TTG 445 
Ala He Gly Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr Leu 
110 115 120 125 

CAC CCT TTT GAG AGC GAC ACG GAC AAA AAT TAC ATC GCT AAA CAT TAC 4 93 

His Pro Phe Glu Ser Asp Thr Asp Lys Asn Tyr He Ala Lys His Tyr 
130 135 140 

CCT AAC GCC ATG ACA AAA GAA TTT ATG GGT AAA ATT TTT GTT TTA AAC 541 
Pro Asn Ala Met Thr Lys Glu Phe Met Gly Lys He Phe Val Leu Asn 
145 150 155 

CCT TAT AAA AAT GAG CAT CCG GGT ATC CAT TAC CAA AAA GAC AAT GAA 589 
Pro Tyr Lys Asn Glu His Pro Gly He His Tyr Gin Lys Asp Asn Glu 
160 165 170 

ACC TTC CAC TTA ATG GAG CAA TTA GCC ACT CAA GGC GCA GAA GCT AAT 63 7 

Thr Phe His Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala Asn 
175 180 185 

ATC AGC ATG CAA CTC ATT CAA ATG GAG GGC GAA AAG GTG TTT TAT AAG 68 5 

He Ser Met Gin Leu He Gin Met Glu Gly Glu Lys Val Phe Tyr Lys 
190 195 200 205 

AAA ATC AAT GAC GAT GAA TTT TTC ATG ATC GGC GAC AAC AGA GAC AAT 73 3 

Lys He Asn Asp Asp Glu Phe Phe Met He Gly Asp Asn Arg Asp Asn 
210 215 220 

TCT AGC GAC TCG CGC TTT TGG GGG AGT GTG GCT TAT AAA AAC ATC GTG 7 81 

Ser Ser Asp Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn He Val 
225 230 235 

GGT TCG CCA TGG TTT GTT TAT TTC AGT TTG AGT TTA AAA AAT AGC CTA 82 9 

Gly Ser Pro Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser Leu 
240 245 250 

GAA ATG GAT GCA GAA AAT AAC CCT AAA AAA CGC TAT CTG GTG CGT TGG 87 7 

Glu Met Asp Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg Trp 
255 260 265 

GAA CGC ATG TTT AAA AGC GTT GGA GGC TTA GAA AAA ATC ATT AAA AAA 92 5 

Glu Arg Met Phe Lys Ser Val Gly Gly Leu Glu Lys He He Lys Lys 
270 275 280 285 

GAA AAC GCA ACG CAT TAAGGTTTTT. TGTGCAATTT TTTGATTTCT CTTTAGAAAG T 981 
Glu Asn Ala Thr His 
290 

TTTATTAC 989 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Phe Leu Arg Ser Val Tyr Ala Phe Cys Ser Ser Trp Val Gly 

1 5 10 15 

Thr lie Val lie Val Leu Leu Val lie Phe Phe lie Ala Gin Ala Phe 

20 25 30 

lie lie Pro Ser Arg Ser Met Val Gly Thr Leu Tyr Glu Gly Asp Met 

35 40 45 

Leu Phe Val Lys Lys Phe Ser Tyr Gly lie Pro lie Pro Lys lie Pro 

50 55 60 

Trp He Glu Leu Pro Val Met Pro Asp Phe Lys Asn Asn Gly His Leu 
65 70 75 80 

He Glu Gly Asp Arg Pro Lys Arg Gly Glu Val Val Val Phe He Pro 

85 90 95 

Pro His Glu Lys Lys Ser Tyr Tyr Val Lys Arg Asn Phe Ala He Gly 

100 105 HO 

Gly Asp Glu Val Leu Phe Thr Asn Glu Gly Phe Tyr Leu His Pro Phe 

115 120 125 

Glu Ser Asp Thr Asp Lys Asn Tyr He Ala Lys His Tyr Pro Asn Ala 

130 135 140 

Met Thr Lys Glu Phe Met Gly Lys lie Phe Val Leu Asn Pro Tyr Lys 
145 150 155 160 

Asn Glu His Pro Gly lie His Tyr Gin Lys Asp Asn Glu Thr Phe His 

165 170 175 

Leu Met Glu Gin Leu Ala Thr Gin Gly Ala Glu Ala Asn He Ser Met 

180 185 190 

Gin Leu He Gin Met Glu Gly Glu Lys Val Phe Tyr Lys Lys He Asn 

195 200 205 

Asp Asp Glu Phe Phe Met He Gly Asp Asn Arg Asp Asn Ser Ser Asp 

210 215 220 

Ser Arg Phe Trp Gly Ser Val Ala Tyr Lys Asn He Val Gly Ser Pro 
225 230 235 240 

Trp Phe Val Tyr Phe Ser Leu Ser Leu Lys Asn Ser Leu Glu Met Asp 

245 250 255 

Ala Glu Asn Asn Pro Lys Lys Arg Tyr Leu Val Arg Trp Glu Arg Met 

260 265 270 

Phe Lys Ser Val Gly Gly Leu Glu Lys He He Lys Lys Glu Asn Ala 
275 280 285 

Thr His 
290 

(2) INFORMATION FOR SEQ ID NO : 3 : 
(i) SEQUENCE CHARACTERISTICS: 
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( A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 112... 471 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGATTTTTTA GAGCTCTTAG TCAATGATAA TGTGGTAGAA ACGATTGAAA AAGGCTTTGT 6 0 
GATAGGTTTT GGAGCGGGGG ATATTACCTA TCAATTAAGA GGCGAAATGT A ATG GGT 117 

Met Gly 
1 

GCA GTG GTT GTT TTA TTT TTA ACG CTG GTT TTA TTG TTT TTA GTT TTA 165 
Ala Val Val Val Leu Phe Leu Thr Leu Val Leu Leu Phe Leu Val Leu 
5 10 15 

AGG GAT TTT GGT TTA GCA AGC CCC AAA CAA AAG ATT TTA GCT TTT TTA 213 
Arg Asp Phe Gly Leu Ala Ser Pro Lys Gin Lys lie Leu Ala Phe Leu 
20 25 30 

ATC GTA GGG ATT ATA GGA GCG AGC ATC AGC GTT TAT ACT TAC AAG CAA 2 61 

He Val Gly He He Gly Ala Ser He Ser Val Tyr Thr Tyr Lys Gin 
35 40 45 50 

AAC CAA CAA AAC CAA CAA GAG ATC GCT TTG CAA AGA GCG TTT TTA AGG 3 09 

Asn Gin Gin Asn Gin Gin Glu He Ala Leu Gin Arg Ala Phe Leu Arg 
55 60 65 

GGG GAA ACC TTG TTG TGT AAA GGC ATT AAA GTC AAT AAC CAA ACC TTT 3 57 

Gly Glu Thr Leu Leu Cys Lys Gly He Lys Val Asn Asn Gin Thr Phe 
70 75 80 

AAT TTA GTG AGC GGA ACT TTA AGC TTT TTA GGC AAA AAA CAA ACC CCT 4 05 

Asn Leu Val Ser Gly Thr Leu Ser Phe Leu Gly Lys Lys Gin Thr Pro 
85 90 95 

ATG AAA GAC GTT CTT GTG GAT TTG GAT TCT TGT CAG ACG CTC CAA AAA 4 53 

Met Lys Asp Val Leu Val Asp Leu Asp Ser Cys Gin Thr Leu Gin Lys 
100 105 HO 

GAT CCC TTA ATC CAA CCC TAATGATGAA TAATAATAAT ACCCCACCCA AACCCCTA 50 9 
Asp Pro Leu He Gin Pro 
115 120 

GAAGA 514 
(2) INFORMATION FOR SEQ ID NO : 4 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Gly Ala Val Val Val Leu Phe Leu Thr Leu Val Leu Leu Phe Leu 

1 5 10 15 

Val Leu Arg Asp Phe Gly Leu Ala Ser Pro Lys Gin Lys lie Leu Ala 

20 25 30 

Phe Leu He Val Gly He He Gly Ala Ser He Ser Val Tyr Thr Tyr 

35 40 45 

Lys Gin Asn Gin Gin Asn Gin Gin Glu He Ala Leu Gin Arg Ala Phe 

50 55 60 

Leu Arg Gly Glu Thr Leu Leu Cys Lys Gly He Lys Val Asn Asn Gin 
65 70 75 80 

Thr Phe Asn Leu Val Ser Gly Thr Leu Ser Phe Leu Gly Lys Lys Gin 

85 90 95 

Thr Pro Met Lys Asp Val Leu Val Asp Leu Asp Ser Cys Gin Thr Leu 

100 105 110 

Gin Lys Asp Pro Leu He Gin Pro 
115 120 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE : 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 135.. -1049 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GTTTTTAATT TAATATTCAT TAAGCTTTTG TGGCTATTCC ATTTTAATTT TGTTTTTCAT 6 0 
TAAAACCCAA TCTAAAATCT TATTTTTATG ATAAAATACC TAATCATAAT ATCAAATCTT 12 0 
AAACCAACGA AACC ATG AAA AAA GCT CTC TTA CTA ACT CTC TCT CTC TCG 17 0 

Met Lys Lys Ala Leu Leu Leu Thr Leu Ser Leu Ser 
15 10 

TTC TGG CTC CAC GCT GAA AGG AAT GGA TTT TAT TTA GGT TTA AAT TTT 218 
Phe Trp Leu His Ala Glu Arg Asn Gly Phe Tyr Leu Gly Leu Asn Phe 
15 20 25 
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CTA GAA GGA AGC TAT ATT AAA GGA CAA GGT AGC ATC GGC AAA AAA GCT 2 66 

Leu Glu Gly Ser Tyr lie Lys Gly Gin Gly Ser He Gly Lys Lys Ala 
30 35 40 

TCA GCA GAA AAC GCC TTA AAT GAA GCG ATC AAT AAC GCA AAA AAT TCA 314 
Ser Ala Glu Asn Ala Leu Asn Glu Ala He Asn Asn Ala Lys Asn Ser 
45 50 55 60 

TTA TTC CCT AAC ACA AAA GCC ATA AGA GAT GCA CAA AAC GCC TTA AAT 3 62 

Leu Phe Pro Asn Thr Lys Ala He Arg Asp Ala Gin Asn Ala Leu Asn 
65 70 75 

GCA GTG AAA GAT TCA AAC AAA ATC GCT AGC CGA TTC GCA GGA AAT GGT 410 
Ala Val Lys Asp Ser Asn Lys He Ala Ser Arg Phe Ala Gly Asn Gly 
80 85 90 

GGA TCG GGC GGT CTT TTT AAT GAG CTC AGC TTT GGG TAT AAA TAT TTT 4 58 

Gly Ser Gly Gly Leu Phe Asn Glu Leu Ser Phe Gly Tyr Lys Tyr Phe 
95 100 105 

TTG GGT AAA AAA AGG ATT ATA GGG TTT AGG CAC TCT CTT TTT TTC GGT. 5 0 6. 

Leu Gly Lys Lys Arg He He Gly Phe Arg His Ser Leu Phe Phe Gly 
110 H5 120 

TAC CAA CTT GGT GGC GTT GGT TCT GTT CCT GGT AGC GGT TTA ATC GTT 5 54 

Tyr Gin Leu Gly Gly Val Gly Ser Val Pro Gly Ser Gly Leu He Val 
125 130 135 140 

TTT TTA CCC TAT GGT TTC AAT ACG GAT TTG CTC ATT AAT TGG ACT AAC 6 02 

Phe Leu Pro Tyr Gly Phe Asn Thr Asp Leu Leu He Asn Trp Thr Asn 
145 150 155 

GAT AAG CGA GCG TCC CAA AAA TAT GTT GAA CGA AGG GTA AAA GGG CTC 650 
Asp Lys Arg Ala Ser Gin Lys Tyr Val Glu Arg Arg Val Lys Gly Leu 
160 165 170 

TCT ATA TTT TAC AAA GAT ATG ACC GGC AGA ACG CTA GAC GCT AAT ACA 6 98 

Ser He Phe Tyr Lys Asp Met Thr Gly Arg Thr Leu Asp Ala Asn Thr 
175 180 185 

TTA AAA AAA GCA TCA AGG CAT GTA TTT AGA AAA TCT TCA GGG CTT GTG 74 6 

Leu Lys Lys Ala Ser Arg His Val Phe Arg Lys Ser Ser Gly Leu Val 

190 195 200 

ATT GGC ATG GAA CTA GGG GGT AGC ACT TGG TTT GCA AGT AAC AAT CTC 7 94 

He Gly Met Glu Leu Gly Gly Ser Thr Trp Phe Ala Ser Asn Asn Leu 

205 210 215 220 

ACC CCT TTC AAT CAA GTC AAG AGT CGC ACG ATT TTT CAG TTG CAA GGA 842 
Thr Pro Phe Asn Gin Val Lys Ser Arg Thr He Phe Gin Leu Gin Gly 
225 230 235 

AAA TTT GGC GTT CGT TGG AAT AAT GAT GAA TAC GAT ATT GAT CGC TAT 8 90 

Lys Phe Gly Val Arg Trp Asn Asn Asp Glu Tyr Asp He Asp Arg Tyr 
240 245 250 
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GGC GAT GAA ATC TAT CTT GGA GGT TCT AGT GTT GAA TTA GGG GTT AAA 93 8 

Gly Asp Glu lie Tyr Leu Gly Gly Ser Ser Val Glu Leu Gly Val Lys 
255 260 265 

GTG CCA GCG TTT AAA GTC AAT TAC TAT AGC GAT GAT TAT GGG GAT AAA 98 6 

Val Pro Ala Phe Lys Val Asn Tyr Tyr Ser Asp Asp Tyr Gly Asp Lys 
270 275 280 

TTG GAT TAT AAA AGA GTG GTG AGC GTT TAT CTT AAC TAT ACA TAT AAC 10 34 

Leu Asp Tyr Lys Arg Val Val Ser Val Tyr Leu Asn Tyr Thr Tyr Asn 
285 290 295 300 

TTT AAA AAC AAA CAT TAAAACACGC TTTTTACCGC TCTTTAGTTG G TTTTTT AAA A 10 90 
Phe Lys Asn Lys His 
305 

AACCTTATTT TTTATTAGCT TGAAACTCTT CAAAGCCTTT TTTTCTCAAT TGGCATGCCG 115 0 

GGCATTTATC GCAACCATAA C C AT AAG CAT GCAAAATCTT TCGCTCTCCT TGATAGCAGG 1210 

TGTGCGTTTC TTTGATGACT AAA 12 3 3 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANfDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
{vj FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Lys Lys Ala Leu Leu Leu Thr Leu Ser Leu Ser Phe Trp Leu His 

15 10 15 

Ala Glu Arg Asn Gly Phe Tyr Leu Gly Leu Asn Phe Leu Glu Gly Ser 

20 25 30 

Tyr lie Lys Gly Gin Gly Ser lie Gly Lys Lys Ala Ser Ala Glu Asn 

35 40 45 

Ala Leu Asn Glu Ala He Asn Asn Ala Lys Asn Ser Leu Phe Pro Asn 

50 55 60 

Thr Lys Ala He Arg Asp Ala Gin Asn Ala Leu Asn Ala Val Lys Asp 
65 70 75 80 

Ser Asn Lys He Ala Ser Arg Phe Ala Gly Asn Gly Gly Ser Gly Gly 

85 90 95 

Leu Phe Asn Glu Leu Ser Phe Gly Tyr Lys Tyr Phe Leu Gly Lys Lys 

100 105 110 

Arg He He Gly Phe Arg His Ser Leu Phe Phe Gly Tyr Gin Leu Gly 

115 120 125 

Gly Val Gly Ser Val Pro Gly Ser Gly Leu He Val Phe Leu Pro Tyr 

130 135 140 

Gly Phe Asn Thr Asp Leu Leu He Asn Trp Thr Asn Asp Lys Arg Ala 
145 150 155 160 

Ser Gin Lys Tyr Val Glu Arg Arg Val Lys Gly Leu Ser He Phe Tyr 
165 170 175 
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Lys 


Asp 


Met 


Thr 


Gly 


Arg 


Thr 


Leu 


Asp 


Ala 


Asn 


Thr 


Leu 


Lys 


Lys 


i\± a 




180 










185 










190 






Ser 


Arg 


His 


Val 


Phe 


Arg 


Lys 


Ser 


Ser 


Gly 


Leu 


Val 


lie 


tj-Ly 


i v ie U 


C 1 n 
olli 




195 










200 










2 0 5 








Leu 


Gly 


Gly 


Ser 


Thr 


Trp 


Phe 


Ala 


Ser 


Asn 


Asn 


Leu 


Thr 


Pro 


Fne 


Asn 




210 








215 










220 










Gin 


Val 


Lys 


Ser 


Arg 


Thr 


lie 


Phe 


Gin 


Leu 


Gin 


Gly 


Lys 


Phe 


biy 


vai 


225 








230 










2 3 5 












Arg 


Trp 


Asn 


Asn 


Asp 


Glu 


Tyr 


Asp 


lie 


Asp 


Arg 


Tyr 


Gly 


Asp 


Glu 


lie 






245 










250 










255 




Tyr 


Leu 


Gly 


Gly 


Ser 


Ser 


Val 


Glu 


Leu 


Gly 


Val 


Lys 


Val 


Pro 


Ala 


Phe 






260 










265 










270 






Lys 


Val 


Asn 


Tyr 


Tyr 


Ser 


Asp 


Asp 


Tyr 


Gly 


Asp 


Lys 


Leu 


Asp 


Tyr 


Lys 




275 










280 










285 








Arg 


Val 


Val 


Ser 


Val 


Tyr 


Leu 


Asn 


Tyr 


Thr 


Tyr 


Asn 


Phe 


Lys 


Asn 


Lys 


290 










295 










300 











His 



305 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 142... 2682 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AATGACGGCT CTAAACCAAA CGATTTGACT TCTCCAAAAG AAGCCTCTCA AGAATCTCAA 6 0 

AAAAATGAAG CTCCAAAAAA TGAAGTTCAA AGAAATGAAG CTCAAAAAGA AACCCCCCAA 12 0 

TCCAATCAAA CGCCTAAAGA A ATG AAA GTC AAG TCC ATT TCT TAT GTC GGG 171 

Met Lys Val Lys Ser He Ser Tyr Val Gly 
15 10 

CTT TCT TAC ATG TCT GAC ATG CTC GCT AAT GAA ATT GTA AAG ATT CGT 2 19 

Leu Ser Tyr Met Ser Asp Met Leu Ala Asn Glu He Val Lys lie Arg 
15 20 25 

GTG GGC GAT ATT GTG GAT TCT AAA AAA ATA GAC ACC GCT GTT TTG GCT 2 67 

Val Gly Asp He Val Asp Ser Lys Lys He Asp Thr Ala Val Leu Ala 
30 35 40 

TTG TTC AAT CAA GGG TAT TTT AAA GAC GTT TAT GCC ACT TTT GAA GGC 315 
Leu Phe Asn Gin Gly Tyr Phe Lys Asp Val Tyr Ala Thr Phe Glu Gly 
45 50 55 
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GGC ATA TTA GAG TTT CAT TTT GAT GAA AAA GCC AGG ATT GCC GGG GTA 3 63 

Gly lie Leu Glu Phe His Phe Asp Glu Lys Ala Arg He Ala Gly Val 
60 65 70 

GAA ATC AAG GGT TAT GGG ACT GAA AAG GAA AAA GAC GGC TTA AAA TCC 411 
Glu He Lys Gly Tyr Gly Thr Glu Lys Glu Lys Asp Gly Leu Lys Ser 
75 80 85 90 

CAA ATG GGG ATC AAA AAG GGC GAC ACC TTT GAT GAG CAA AAA TTA GAG 45 9 

Gin Met Gly He Lys Lys Gly Asp Thr Phe Asp Glu Gin Lys Leu Glu 
95 100 105 

CAT GCT AAA ACG GCT TTA AAA ACC GCT TTA GAG GGG CAG GGC TAT TAT 507 
His Ala Lys Thr Ala Leu Lys Thr Ala Leu Glu Gly Gin Gly Tyr Tyr 
110 115 120 

GGG AGC GTG GTG GAG GTG CGC ACA GAA AAG GTC AGT GAG GGT GCA TTA 5 55 

Gly Ser Val Val Glu Val Arg Thr Glu Lys Val Ser Glu Gly Ala Leu 
125 130 135 

TTG ATC GTG TTT GAT GTG AAT AGG GGG GAT AGC ATT TAT ATC AAA CAA 603 
Leu lie Val Phe Asp Val Asn Arg Gly Asp Ser He Tyr He Lys Gin 
140 145 150 

TCC ATT TAT GAG GGA AGC GCG AAA TTA AAA CGC CGC ATG ATT GAA TCT 651 
Ser He Tyr Glu Gly Ser Ala Lys Leu Lys Arg Arg Met He Glu Ser 
155 160 165 170 

TTG AGT GCG AAC AAG CAA CGA GAT TTC ATG GGC TGG ATG TGG GGC TTG 6 99 

Leu Ser Ala Asn Lys Gin Arg Asp Phe Met Gly Trp Met Trp Gly Leu 
175 180 185 

AAT GAC GGG AAA TTG CGT TTA GAT CAA CTA GAA TAC GAT TCT ATG CGT 74 7 

Asn Asp Gly Lys Leu Arg Leu Asp Gin Leu Glu Tyr Asp Ser Met Arg 
190 195 200 

ATC CAA GAT GTG TAT ATG CGT AGG GGT TAC TTA GAC GCT CAT ATT 'TCT 7 95 

He Gin Asp Val Tyr Met Arg Arg Gly Tyr Leu Asp Ala His He Ser 
205 210 215 

TCG CCT TTT TTG AAA ACG GAT TTT TCT ACC CAT GAC GCT AAG CTT CAT 84 3 

Ser Pro Phe Leu Lys Thr Asp Phe Ser Thr His Asp Ala Lys Leu His 
220 225 230 

TAT AAA GTC AAA GAG GGG ATC CAA TAC AGG ATT TCA GAC ATT TTA ATA 8 91 

Tyr Lys Val Lys Glu Gly He Gin Tyr Arg He Ser Asp He Leu He 
235 240 245 250 

GAG ATT GAC AAC CCG GTA GTC CCC TTA AAA ACC TTA GAA AAA GCG CTT 93 9 

Glu He Asp Asn Pro Val Val Pro Leu Lys Thr Leu Glu Lys Ala Leu 
255 260 265 

AAA GTG AAA AGG AAA GAT GTC TTT AAT ATT GAG CAT TTA AGA GCG GAT 9 87 

Lys Val Lys Arg Lys Asp Val Phe Asn He Glu His Leu Arg Ala Asp 
270 275 280 
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GCG CAA ATT TTA AAA ACC 

Ala Gin lie Leu Lys Thr 
285 

GTG GTG AAG CCA GAC TTG 

Val Val Lys Pro Asp Leu 
300 

GTC ATT TAT CGT ATT GAA 

Val lie Tyr Arg lie Glu 

315 320 

ATC ATT TCA GGG AAC CAG 
He He Ser Gly Asn Gin 
335 

TTA TTG TTA GGG CCT AAG 
Leu Leu Leu Gly Pro Lys 
350 

TCC GAA AAT TCT TTA AGG 
Ser Glu Asn Ser Leu Arg 
365 

GAA GAA AAA AGG GTT AAT 
Glu Glu Lys Arg Val Asn 
380 

GAA GAG GGG CGT ACT GGG 
Glu Glu Gly Arg Thr Gly 
395 400 

TAT GGA GGG CTT ATG CTT 
Tyr Gly Gly Leu Met Leu 
415 

GGC ACA GGG CAA AGC ATG 
Gly Thr Gly Gin Ser Met 
430 

GGT AGA TCT TAT CCG GGC 
Gly Arg Ser Tyr Pro Gly 
445 

GGG AAT TTG AGC TTG ACT 
Gly Asn Leu Ser Leu Thr 
460 

TCT ACG ATC AAC CTT TAT 
Ser Thr He Asn Leu Tyr 
475 480 
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GAA ATC GCC GAT AAG GGT 
Glu He Ala Asp Lys Gly 
290 

GAT AAA GAT GAA AAA AAC 
Asp Lys Asp Glu Lys Asn 
305 310 

GTG GGC GAT ATG GTG TAT 
Val Gly Asp Met Val Tyr 
325 

CGC ACG AGC GAT AGG ATC 
Arg Thr Ser Asp Arg He 
340 

GAT AAA TAC AAC TTG ACC 
Asp Lys Tyr Asn Leu Thr 
355 

CGT TTA GGA TTC TTC TCT 
Arg Leu Gly Phe Phe Ser 
370 

AGC TCA CTC ATG GAT TTA 
Ser Ser Leu Met Asp Leu 
385 390 

CAG TTG CAA TTT GGG TTA 
Gin Leu Gin Phe Gly Leu 
405 

AAT GGG AGC GTG AGC GAA 
Asn Gly Ser Val Ser Glu 
420 

AGC TTG TAT GCT AAC ATC 
Ser Leu Tyr Ala Asn He 
435 

ATG CCA AAA GGA GCG GGG 
Met Pro Lys Gly Ala Gly 
450 

AAT CCA AGG ATT TTT GAC 
Asn Pro Arg He Phe Asp 
465 470 

GCG GAT TAC AGG ATA AGC 
Ala Asp Tyr Arg He Ser 
485 



TAT GCG TTT GCG 103 5 

Tyr Ala Phe Ala 

295 

GGG CTT GTG AAA 1083 
Gly Leu Val Lys 

ATC AAT GAT GTC 1131 
He Asn Asp Val 
330 

ATT AGA AGG GAG 117 9 

He Arg Arg Glu 
345 

AAA CTG AGA AAT 12 2 7 

Lys Leu Arg Asn 
360 

AAA GTC AAA ATT 12 75 

Lys Val Lys He 

375 

TTA GTG AGC GTA 132 3 

Leu Val Ser Val 



GGC TAT GGC TCT 13 71 

Gly Tyr Gly Ser 
410 

AGA AAC CTT TTT 1419 
Arg Asn Leu Phe 
425 

GCT ACA GGG GGG 14 67 

Ala Thr Gly Gly 
440 

CGT ATG TTT GCC 1515 

Arg Met Phe Ala 

455 

AGC TGG TAT AGC 1563 
Ser Trp Tyr Ser 



TAC CAA TAC ATC 1611 
Tyr Gin Tyr He 
490 



CAA CAA GGC GGG GGC TTT GGG GTG AAT GTC GGG CGC ATG CTG GGT AAT 
Gin Gin Gly Gly Gly Phe Gly Val Asn Val Gly Arg Met Leu Gly Asn 
495 500 505 
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AGA ACC CAT GTG AGC TTA GGG TAT AAC TTG AAT GTT ACC AAA CTC CTT 17 07 

Arg Thr His Val Ser Leu Gly Tyr Asn Leu Asn Val Thr Lys Leu Leu 
510 515 520 

GGT TTC AGC AGC CCT TTA TAC AAC CGC TAC TAT TCC TCT GTT AAT GAA 17 5 5 

Gly Phe Ser Ser Pro Leu Tyr Asn Arg Tyr Tyr Ser Ser Val Asn Glu 
525 530 535 

GTG GTT TCT CCA AGG CAA TGT TCT ACC CCC GCA TCG GTG ATT ATC AAT 18 0 3 

Val Val Ser Pro Arg Gin Cys Ser Thr Pro Ala Ser Val lie He Asn 
540 545 550 

CGC TTA TCA GGC GGT AAA ACC CCC TTA CAA CCT GAA AGC TGT TCT AGT 18 51 

Arg Leu Ser Gly Gly Lys Thr Pro Leu Gin Pro Glu Ser Cys Ser Ser 
555 560 565 570 

CCT GGA GCG ATC ACC ACT TCA CCA GAA ATA AGA GGT ATT TGG GAT AGG 18 99 

Pro Gly Ala He Thr Thr Ser Pro Glu He Arg Gly He Trp Asp Arg 
575 580 585 

GAT TAC CAT ACG CCT ATC ACC AGC TCT TTC ACC CTT GAT GTG AGC TAT 134 7 

Asp Tyr His Thr Pro He Thr Ser Ser Phe Thr Leu Asp Val Ser Tyr 
590 595 600 

GAC AAC ACC GAT GAT TAT TAC TTC CCT AGA AAT GGG GTT ATC TTT AGT 1995 
Asp Asn Thr Asp Asp Tyr Tyr Phe Pro Arg Asn Gly Val He Phe Ser 
605 610 615 

TCC TAT GCG ACG ATG TCT GGC TTG CCA AGC TCT GGC ACG CTC AAT TCT 2 04 3 

Ser Tyr Ala Thr Met Ser Gly Leu Pro Ser Ser Gly Thr Leu Asn Ser 
620 625 630 

TGG AAC GGG TTA GGC GGG AAT GTC CGT AAC ACC AAA GTT TAT GGT AAA 2 0 91 

Trp Asn Gly Leu Gly Gly Asn Val Arg Asn Thr Lys Val Tyr Gly Lys 
635 640 645 650 

TTC GCC GCT TAC CAC CAT TTG CAA AAA TAT TTA TTG ATA GAT TTG ATC 213 9 

Phe Ala Ala Tyr His His Leu Gin Lys Tyr Leu Leu He Asp Leu He 
655 660 665 

GCT CGC TTT AAA ACG CAA GGA GGT TAT ATC TTT AGG TAT AAC ACC GAT 2187 
Ala Arg Phe Lys Thr Gin Gly Gly Tyr He Phe Arg Tyr Asn Thr Asp 
670 675 680 

GAT TAC TTG CCC TTA AAC TCC ACC TTC TAC ATG GGG GGC GTA ACC ACG 223 5 

Asp Tyr Leu Pro Leu Asn Ser Thr Phe Tyr Met Gly Gly Val Thr Thr 
685 690 695 

GTG AGA GGC TTT AGG AAC GGA TCG GTT ACT CCT AAA GAT GAG TTT GGC 22 83 

Val Arg Gly Phe Arg Asn Gly Ser Val Thr Pro Lys Asp Glu Phe Gly 
700 705 710 

TTG TGG CTT GGA GGC GAT GGG ATT TTT ACC GCT TCT ACT GAA TTG AGC 23 31 

Leu Trp Leu Gly Gly Asp Gly He Phe Thr Ala Ser Thr .Glu Leu Ser 
715 720 725 730 
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TAT GGG GTG CTA AAG GCG GCT AAA ATG CGC TTA GCG TGG TTT TTT GAC 2 37 9 

Tyr Gly Val Leu Lys Ala Ala Lys Met Arg Leu Ala Trp Phe Phe Asp 
735 740 745 

TTT GGT TTC TTA ACC TTT AAA ACC CCA ACT AGA GGG AGT TTT TTC TAT 24 2 7 

Phe Gly Phe Leu Thr Phe Lys Thr Pro Thr Arg Gly Ser Phe Phe Tyr 
750 755 760 

AAC GCT CCT GTT ACG ACA GCG AAT TTT AAA GAT TAT GGC GTT ATA GGG 2 4 75 

Asn Ala Pro Val Thr Thr Ala Asn Phe Lys Asp Tyr Gly Val He Gly 
765 770 775 

GCT GGG TTT GAA AGA GCG ACT TGG AGG GCT TCC ACA GGC TTG CAG ATT 2 52 3 

Ala Gly Phe Glu Arg Ala Thr Trp Arg Ala Ser Thr Gly Leu Gin He 
780 785 790 

GAA TGG ATT TCG CCC ATG GGG CCT TTG GTG TTG ATT TTC CCT ATA GCG 2 571 

Glu Trp He Ser Pro Met Gly Pro Leu Val Leu He Phe Pro He Ala 
795 800 805 810 

TTT TTC AAC CAA TGG GGC GAT GGC AAT GGC AAG AAA TGT AAA GGG CTA 2.619 
Phe Phe Asn Gin Trp Gly Asp Gly Asn Gly Lys Lys Cys Lys Gly Leu 
815 820 825 

TGC TTC AAC CCT AAC ATG GAC GAT TAC ACG CAA CAC TTT GAA TTT TCT 2 667 

Cys Phe Asn Pro Asn Met Asp Asp Tyr Thr Gin His Phe Glu Phe Ser 
830 835 840 

ATG GGA ACA AGG TTT TAAAATGCGC AT C AAC AG AG AAGAAATTTT GGATTTAATG A 27 23 
Met Gly Thr Arg Phe 
845 

AAAACGCGCC CTTGAAAGAA TTGGGGCAAA GGG CTTTGAG GGTGAAGCAA CGCTTGCACC 2 7 83 

CTGAAAACTT GACGACTTTT ATTG TGG AT A GGAATATCAA TTACACCAAT ATTTG TTTTG 2 84 3 

TGGATTGCAA GTTTTGCGCG TTC AAACG C A CCTTAAAAGA AAAAGACGCC TATGTGTTGA 2 903 

G CTATG AAG A AATTGATCAA AAGATTGAAG AATTGCTCGC TATTGGCGGC ACGCAGATCC 2 96 3 

TTTTTCAAGG GGGGGTG CAC CCGCAGCTAA AGATTGATTA TTATGAGAA 3 012 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 847 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Val Lys Ser He Ser Tyr Val Gly Leu Ser Tyr Met Ser Asp 

15 10 

Met Leu' Ala Asn Glu He Val Lys He Arg Val Gly Asp He Val Asp 
20 25 30 
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Ser Lys Lys lie 
35 

Phe Lys Asp Val 
50 

Phe Asp Glu Lys 
65 

Thr Glu Lys Glu 

Gly Asp Thr Phe 
100 

Lys Thr Ala Leu 
115 

Arg Thr Glu Lys 
130 

Asn Arg Gly Asp 
145 

Ala Lys Leu Lys 

Arg Asp Phe Met 
180 

Leu Asp Gin Leu 
195 

Arg Arg Gly Tyr 
210 

Asp Phe Ser Thr 
225 

lie Gin Tyr Arg 

Val Pro Leu Lys 
260 

Val Phe Asn lie 
275 

Glu lie Ala Asp 
290 

Asp Lys Asp Glu 
305 

Val Gly Asp Met 

Arg Thr Ser Asp 
340 

Asp Lys Tyr Asn 
355 

Arg Leu Gly Phe 
370 

Ser Ser Leu Met 
385 

Gin Leu Gin Phe 

Asn Gly Ser Val 
420 

Ser Leu Tyr Ala 
435 

Met Pro Lys Gly 
450 

Asn Pro Arg lie 



Asp Thr Ala Val 
40 

Tyr Ala Thr Phe 
55 

Ala Arg lie Ala 
70 

Lys Asp Gly Leu 
85 

Asp Glu Gin Lys 

Glu Gly Gin Gly 
120 

Val Ser Glu Gly 
135 

Ser lie Tyr lie 
150 

Arg Arg Met lie 
165 

Gly Trp Met Trp 

Glu Tyr Asp Ser 
200 

Leu Asp Ala His 
215 

His Asp Ala Lys 
230 

lie Ser Asp lie 
245 

Thr Leu Glu Lys 

Glu His Leu Arg 
280 

Lys Gly Tyr Ala 
295 

Lys Asn Gly Leu 
310 

Val Tyr lie Asn 
325 

Arg lie lie Arg 

Leu Thr Lys Leu 
360 

Phe Ser Lys Val 
375 

Asp Leu Leu Val 
390 

Gly Leu Gly Tyr 
405 

Ser Glu Arg Asn 

Asn lie Ala Thr 
440 

Ala Gly Arg Met 
455 

Phe Asp Ser Trp 
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Leu Ala Leu Phe 

Glu Gly Gly He 
60 

Gly Val Glu He 
75 

Lys Ser Gin Met 
90 

Leu Glu His Ala 
105 

Tyr Tyr Gly Ser 

Ala Leu Leu He 
140 

Lys Gin Ser He 
155 

Glu Ser Leu Ser 
170 

Gly Leu Asn Asp 
185 

Met Arg He Gin 

He Ser Ser Pro 
220 

Leu His Tyr Lys 
235 

Leu He Glu He 
250 

Ala Leu Lys Val 
265 

Ala Asp Ala Gin 

Phe Ala Val Val 
300 

Val Lys Val He 
315 

Asp Val He He 
330 

Arg Glu Leu Leu 
345 

Arg Asn Ser Glu 

Lys He Glu Glu 
380 

Ser Val Glu Glu 
395 

Gly Ser Tyr Gly 
410 

Leu Phe Gly Thr 
425 

Gly Gly Gly Arg 

Phe Ala Gly Asn 
460 

Tyr Ser Ser Thr 



Asn Gin Gly Tyr 
45 

Leu Glu Phe His 

Lys Gly Tyr Gly 
80 

Gly He Lys Lys 
95 

Lys Thr Ala Leu 
110 

Val Val Glu Val 
125 

Val Phe Asp Val 

Tyr Glu Gly Ser 
160 

Ala Asn Lys Gin 
175 

Gly Lys Leu Arg 
190 

Asp Val Tyr Met 
205 

Phe Leu Lys Thr 

Val Lys Glu Gly 
240 

Asp Asn Pro Val 
255 

Lys Arg Lys Asp 
270 

He Leu Lys Thr 
285 

Lys Pro Asp Leu 

Tyr Arg He Glu 
320 

Ser Gly Asn Gin 
335 

Leu Gly Pro Lys 
350 

Asn Ser Leu Arg 
365 

Lys Arg Val Asn 

Gly Arg Thr Gly 
400 

Gly Leu Met Leu 
415 

Gly Gin Ser Met 
430 

Ser Tyr Pro Gly 
445 

Leu Ser Leu Thr 
He Asn Leu Tyr 
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465 470 475 480 

Ala Asp Tyr Arg He Ser Tyr Gin Tyr He Gin Gin Gly Gly Gly Phe 

485 490 495 

Gly Val Asn Val Gly Arg Met Leu Gly Asn Arg Thr His Val Ser Leu 

500 505 510 

Gly Tyr Asn Leu Asn Val Thr Lys Leu Leu Gly Phe Ser Ser Pro Leu 

515 520 525 

Tyr Asn Arg Tyr Tyr Ser Ser Val Asn Glu Val Val Ser Pro Arg Gin 

530 535 540 

Cys Ser Thr Pro Ala Ser Val He He Asn Arg Leu Ser Gly Gly Lys 
545 550 555 560 

Thr Pro Leu Gin Pro Glu Ser Cys Ser Ser Pro Gly Ala He Thr Thr 

565 570 575 

Ser Pro Glu He Arg Gly He Trp Asp Arg Asp Tyr His Thr Pro He 

580 585 590 

Thr Ser Ser Phe Thr Leu Asp Val Ser Tyr Asp Asn Thr Asp Asp Tyr 

595 600 605 

Tyr Phe Pro Arg Asn Gly Val He Phe Ser Ser Tyr Ala Thr Met Ser 

610 615 620 

Gly Leu Pro Ser Ser Gly Thr Leu Asn Ser Trp Asn Gly Leu Gly Gly 
625 630 635 640 

Asn Val Arg Asn Thr Lys Val Tyr Gly Lys Phe Ala Ala Tyr His His 

645 650 655 

Leu Gin Lys Tyr Leu Leu He Asp Leu He Ala Arg Phe Lys Thr Gin 

660 665 670 

Gly Gly Tyr He Phe Arg Tyr Asn Thr Asp Asp Tyr Leu Pro Leu Asn 

675 680 685 

Ser Thr Phe Tyr Met Gly Gly Val Thr Thr Val Arg Gly Phe Arg Asn 

690 695 700 

Gly Ser Val Thr Pro Lys Asp Glu Phe Gly Leu Trp Leu Gly Gly Asp 
705 710 715 720 

Gly He Phe Thr Ala Ser Thr Glu Leu Ser Tyr Gly Val Leu Lys Ala 

725 730 735 

Ala Lys Met Arg Leu Ala Trp Phe Phe Asp Phe Gly Phe Leu Thr Phe 

740 745 750 

Lys Thr Pro Thr Arg Gly Ser Phe Phe Tyr Asn Ala Pro Val Thr Thr 

755 760 765 

Ala Asn Phe Lys Asp Tyr Gly Val He Gly Ala Gly Phe Glu Arg Ala 

770 775 780 

Thr Trp Arg Ala Ser Thr Gly Leu Gin He Glu Trp He Ser Pro Met 
785 790 795 800 

Gly Pro Leu Val Leu He Phe Pro He Ala Phe Phe Asn Gin Trp Gly 

805 810 815 

Asp Gly Asn Gly Lys Lys Cys Lys Gly Leu Cys Phe Asn Pro Asn Met 

820 825 830 

Asp Asp Tyr Thr Gin His Phe Glu Phe Ser Met Gly Thr Arg Phe 
835 840 845 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 149... 913 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

ATGTTTTGTG TTGCAAAAAC AAAACAGACC AATAAAGGCA TCACTTTTAA AAGCGTTGTT 60 
TAGGGGGGTT TGGTTATTGG TGTTTGATTA GAATAGGGTT GTTTTTAATT TTCTTTTAAG 12 0 

AGGAGTTTTT ACTTTTTTAA GGGTTTTT ATG GAT ATT TAT GCG TTA TAT ATA 172 

Met Asp lie Tyr Ala Leu Tyr lie 
1 5 

GCG ATA GGG CTT TTT ACT GGC ATT CTA TCA GGG ATT TTT GGC ATT GGT 22 0 

Ala lie Gly Leu Phe Thr Gly lie Leu Ser Gly lie Phe Gly lie Gly 
10 15 20 

GGG GGG TTG ATC ATT GTC CCT ATC ATG CTC GCA ACC GGG CAT TCT TTT 2 68 

Gly Gly Leu lie He Val Pro He Met Leu Ala Thr Gly His Ser Phe 
25 30 35 40 

GAA GAA TCC ATT GGG ATT TCC ATT TTG CAA ATG GCG CTT TCA TCG TTC 316 
Glu Glu Ser He Gly He Ser He Leu Gin Met Ala Leu Ser Ser Phe 
45 50 55 

GTG GGC TCT GTT TTG AAT TTC AAA AAA AAA TCG CTT GAT TTT TCT TTA 3 64 

Val Gly Ser Val Leu Asn Phe Lys Lys Lys Ser Leu Asp Phe Ser Leu 
60 65 70 

GGC TTG TTG ATA GGG GCA GGG GGG CTG ATA GGG GCG AGT TTT AGC GGA 412 
Gly Leu Leu He Gly Ala Gly Gly Leu He Gly Ala Ser Phe Ser Gly 
75 80 85 

TTT GTT TTA AAA ATC GTT TCC AGT AAA ATT TTA ATG GTT ATT TTC GCG 4 60 

Phe Val Leu Lys He Val Ser Ser Lys He Leu Met Val He Phe Ala 
90 95 100 

CTT TTA GTC GTG TAT TCT ATG ATC CAA TTT GTT TTG AAA CCC AAA AAA 50 8 

Leu Leu Val Val Tyr Ser Met He Gin Phe Val Leu Lys Pro Lys Lys 
105 110 115 120 

AAA GAT TTG ATA GCG GAT ACT AAA CGC TAT CAT CTG CAA GGT TTG AAA 556 
Lys Asp Leu He Ala Asp Thr Lys Arg Tyr His Leu Gin Gly Leu Lys 
125 130 135 

TTA TTT TTA ATT GGC ACG CTC ACA GGG TTT TTT GCT ATC ACT TTA GGG 6 04 

Leu Phe Leu He Gly Thr Leu Thr Gly Phe Phe Ala lie Thr Leu Gly 
140 145 150 

ATT GGT GGG GGG ATG CTC ATG GTG CCT TTG ATG CAT TAT TTT TTA GGG 652 
He Gly Gly Gly Met Leu Met Val Pro Leu Met His Tyr Phe Leu Gly 
155 160 165 
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TAT GAT TCT AAA AAA TGC GTG GCT CTA GGG TTA TTT TTC ATC TTG TTT 
Tyr Asp Ser Lys Lys Cys Val Ala Leu Gly Leu Phe Phe lie Leu Phe 
170 175 180 

TCT TCT ATT TCA GGA GCT TTT TCT TTA ATG TAT CAC CAC ATC ATC AAT 74 8 

Ser Ser lie Ser Gly Ala Phe Ser Leu Met Tyr His His lie lie Asn 
18 5 190 195 200 

AAA GAA GTG CTC TTA GCA GGG GCG ATT GTG GGA TTA GGA TCT GTT ATG 7 96 

Lys Glu Val Leu Leu Ala Gly Ala lie Val Gly Leu Gly Ser Val Met 
205 210 215 

GGC GTG AGC ATT GGG ATT AAA TGG ATC ATG GGG CTT TTG AAT GAA AAA 84 4 

Gly Val Ser lie Gly He Lys Trp He Met Gly Leu Leu Asn Glu Lys 
220 225 230 

ATG CAT AAA GCT TTG ATT TTA GGG GTG TAT GGT TTG TCG CTA TTG ATT 8 92 

Met His Lys Ala Leu He Leu Gly Val Tyr Gly Leu Ser Leu Leu He 
235 240 245 

GTT TTA TAC AAA CTC TTT TTT TAATTGATGG TTTTATACCA C T AC T ATT TT AAGA 9.4 7 
Val Leu Tyr Lys Leu Phe Phe 
250 255 

CCCCTAAGAG TTTCCCTTTA GAGTATTTGC ATTTGTGCGC TAATGAGAGC CATTTATTGA 10 07 
GATTGGATTT TGATGCGGCC AATTT 1032 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 

Met Asp He Tyr Ala Leu Tyr He Ala He Gly Leu Phe Thr Gly He 

1 5 10 15 

Leu Ser Gly He Phe Gly He Gly Gly Gly Leu He He Val Pro He 

20 25 30 

Met Leu Ala Thr Gly His Ser Phe Glu Glu Ser He Gly He Ser He 

35 40 45 

Leu Gin Met Ala Leu Ser Ser Phe Val Gly Ser Val Leu Asn Phe Lys 

50 55 60 

Lys Lys Ser Leu Asp Phe Ser Leu" Gly Leu Leu lie Gly Ala Gly Gly 

65 70 75 80 

Leu He Gly Ala Ser Phe Ser Gly Phe Val Leu Lys He Val Ser Ser 

85 90 95 

Lys He Leu Met Val He Phe Ala Leu Leu Val Val Tyr Ser Met He 

100 105 HO 

Gin Phe Val Leu Lys Pro Lys Lys Lys Asp Leu He Ala Asp Thr Lys 
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115 120 125 

Arg Tyr His Leu Gin Gly Leu Lys Leu Phe Leu lie Gly Thr Leu Thr 

130 135 140 

Gly Phe Phe Ala He Thr Leu Gly He Gly Gly Gly Met Leu Met Val 
145 150 155 160 

Pro Leu Met His Tyr Phe Leu Gly Tyr Asp Ser Lys Lys Cys Val Ala 

165 170 175 

Leu Gly Leu Phe Phe He Leu Phe Ser Ser He Ser Gly Ala Phe Ser 

180 185 190 

Leu Met Tyr His His He lie Asn Lys Glu Val Leu Leu Ala Gly Ala 

195 200 205 

He Val Gly Leu Gly Ser Val Met Gly Val Ser He Gly He Lys Trp 

210 215 220 

He Met Gly Leu Leu Asn Glu Lys Met His Lys Ala Leu He Leu Gly 
225 230 235 240 

Val Tyr Gly Leu Ser Leu Leu He Val Leu Tyr Lys Leu Phe Phe 
245 250 255 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 6 6... 9 80 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAGAGCATGC GAGAGAGCAT AGAGGAATTT TTTAATCAAG AAATGTTGCA AAGTGAAGTG 6 0 

CCGTT ATG GGT AGA ATT GAA TCA AAA AAG CGT TTG AAA GCG CTT GTT TTT 110 
Met Gly Arg He Glu Ser Lys Lys Arg Leu Lys Ala Leu Val Phe 
15 10 15 

TTA GCC AGC TTG GGG GTT TTG TGG GGC AAT AGC GCT GAA AAA ACG CCT 15 8 

Leu Ala Ser Leu Gly Val Leu Trp Gly Asn Ser Ala Glu Lys Thr Pro 
20 25 30 

TTT TTT AAA ACG AAA AAC CAC ATT TAT CTA GGT TTT AGG CTA GGC ACA 2 06 

Phe Phe Lys Thr Lys Asn His He Tyr Leu Gly Phe Arg Leu Gly Thr 
35 40 45 

GGA GCC AAT GTG CAC ACG AGC ATG TGG CAA CAA GCC TAT AAA GAC AAC 2 54 

Gly Ala Asn Val His Thr Ser Met Trp Gin Gin Ala Tyr Lys Asp Asn 
50 55 60 

CCC ACC TGC CCT GGT AGC GTG TGT TAT GGC GAG AAA TTA GAA GCC CAT 3 02 

Pro Thr Cys Pro Gly Ser Val Cys Tyr Gly Glu Lys Leu Glu Ala His 
65 70 75 
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TAT CAG GGG GGT AAA AAC CTG TCT TAT ACC GGG CAA ATA GGC GAT GAA 3 50 

Tyr Gin Gly Gly Lys Asn Leu Ser Tyr Thr Gly Gin lie Gly Asp Glu 
80 85 90 95 

ATA GCT TTT GAT AAA CAC CAT ATT TTA GGC TTA AGG GTG TGG GGG GAT 3 98 

He Ala Phe Asp Lys His His He Leu Gly Leu Arg Val Trp Gly Asp 
100 105 HO 

GTA GAA TAC GCT AAA GCG CAA TTA GGT CAA AAA GTG GGG GGT AAT ACC 44 6 

Val Glu Tyr Ala Lys Ala Gin Leu Gly Gin Lys Val Gly Gly Asn Thr 
115 120 125 

CTT TTA TCC CAA GCC AAT TAT GAC CCA AAC GCG ATT AAA ACC TAC GAT 4 94 

Leu Leu Ser Gin Ala Asn Tyr Asp Pro Asn Ala He Lys Thr Tyr Asp 
130 135 140 

TCT GCT TCA AAC ACT CAA GGC CCT TTA GTT TTG CAA AAA ACC CCA AGC 54 2 

Ser Ala Ser Asn Thr Gin Gly Pro Leu Val Leu Gin Lys Thr Pro Ser 
145 150 155 

CCT CAA AAC TTC CTT TTC AAT AAC GGG CAT TTC ATG GCG TTT GGT TTG 5 90 

Pro Gin Asn Phe Leu Phe Asn Asn Gly His Phe Met Ala Phe Gly Leu 
160 165 170 175 

AAC GTG AAT GTG TTT GTT AAC CTC CCT ATA GAC ACC CTT TTA AAA CTC 63 8 

Asn Val Asn Val Phe Val Asn Leu Pro He Asp Thr Leu Leu Lys Leu 
180 185 190 

GCT TTA AAA ACA GAA AAA ATG CTG TTT TTT AAA ATA GGC GTG TTT GGT 6 86 

Ala Leu Lys Thr Glu Lys Met Leu Phe Phe Lys He Gly Val Phe Gly 
195 200 205 

GGG GGC GGG GTG GAA TAC GCA ATA TTA TGG AGT CCT AAC TAT CAA AAT 73 4 

Gly Gly Gly Val Glu Tyr Ala He Leu Trp Ser Pro Asn Tyr Gin Asn 
210 215 220 

CAA AAC ACG AAA CAA GGC GAT AAA TTT TTT GCA GCG GGT GGG GGG TTT 7 82 

Gin Asn Thr Lys Gin Gly Asp Lys Phe Phe Ala Ala Gly Gly Gly Phe 
225 230 235 

TTT GTG AAT TTT GGG GGT TCT TTG TAT ATA GGC AAA CGC AAC CGC TTC 83 0 

Phe Val Asn Phe Gly Gly Ser Leu Tyr He Gly Lys Arg Asn Arg Phe 
240 245 250 255 

AAT GTG GGG TTA AAA ATC CCT TAC TAT AGC TTG AGC GCG CAA AGT TGG 87 8 

Asn Val Gly Leu Lys He Pro Tyr Tyr Ser Leu Ser Ala Gin Ser Trp 
260 265 270 

AAA AAC TTT GGC TCT AGC AAT GTG TGG CAG CAA CAA ACG ATC CGA CAA 92 6 

Lys Asn Phe Gly Ser Ser Asn Val Trp Gin Gin Gin Thr He Arg Gin 
275 280 285 

AAC TTC AGC GTT TTT AGG AAT AAA GAA GTT TTT GTC AGC TAC GCG TTC 97 4 

Asn Phe Ser Val Phe Arg Asn Lys Glu Val Phe Val Ser Tyr Ala Phe 
290 295 300 
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TTG TTT TAGTTTGGAT TCGTTCTCAT TAAACACTGA TGATAAAATT CAAAAGATGG TT 103 2 
Leu Phe 
305 

TTATCGTTAC AAAATTCAAC ATTTC 105 7 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 305 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gly Arg lie Glu Ser Lys Lys Arg Leu Lys Ala Leu Val Phe Leu 

15 10 15 

Ala Ser Leu Gly Val Leu Trp Gly Asn Ser Ala Glu Lys Thr Pro Phe 

20 25 30 

Phe Lys Thr Lys Asn His lie Tyr Leu Gly Phe Arg Leu Gly Thr Gly 

35 40 45 

Ala Asn Val His Thr Ser Met Trp Gin Gin Ala Tyr Lys Asp Asn Pro 

50 55 60 

Thr Cys Pro Gly Ser Val Cys Tyr Gly Glu Lys Leu Glu Ala His Tyr 
65 70 75 80 

Gin Gly Gly Lys Asn Leu Ser Tyr Thr Gly Gin lie Gly Asp Glu lie 

85 90 95 

Ala Phe Asp Lys His His lie Leu Gly Leu Arg Val Trp Gly Asp Val 

100 105 110 

Glu Tyr Ala Lys Ala Gin Leu Gly Gin Lys Val Gly Gly Asn Thr Leu 

115 120 125 

Leu Ser Gin Ala Asn Tyr Asp Pro Asn Ala He Lys Thr Tyr Asp Ser 

130 135 140 

Ala Ser Asn Thr Gin Gly Pro Leu Val Leu Gin Lys Thr Pro Ser Pro 
145 150 155 160 

Gin Asn Phe Leu Phe Asn Asn Gly His Phe Met Ala Phe Gly Leu Asn 

165 170 175 

Val Asn Val Phe Val Asn Leu Pro He Asp Thr Leu Leu Lys Leu Ala 

180 185 190 

Leu Lys Thr Glu Lys Met Leu Phe Phe Lys He Gly Val Phe Gly Gly 

195 200 205 

Gly Gly Val Glu Tyr Ala He Leu Trp Ser Pro Asn Tyr Gin Asn Gin 

210 215 220 

Asn Thr Lys Gin Gly Asp Lys Phe Phe Ala Ala Gly Gly Gly Phe Phe 
225 230 235 240 

Val Asn Phe Gly Gly Ser Leu Tyr He Gly Lys Arg Asn Arg Phe Asn 

245 250 255 

Val Gly Leu Lys He Pro Tyr Tyr Ser Leu Ser Ala Gin Ser Trp Lys 

260 265 270 

Asn Phe Gly Ser Ser Asn Val Trp Gin Gin Gin Thr He Arg Gin Asn 
275 280 285 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 



-94- 

Phe Ser Val Phe Arg Asn Lys Glu Val Phe Val Ser Tyr Ala Phe Leu 
290 295 300 

Phe 
305 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 77.^.535 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TATTAGTTGG TTTAATACGC TATAATCTGT GTGCCAACAT TGTGTGGCTC AAATCATTTT 6 0 

TAAAAGGGGT TTTATA ATG GAA AAC AAC GAA AAT CAT GAG AAA TTG AAT GGC 112 
Met Glu Asn Asn Glu Asn His Glu Lys Leu Asn Gly 
15 10 

GTT TTG CGC AAG TTT TTA GGC GAT GCG TTC ACG CTT GAT GGG AAA GAA 16 0 

Val Leu Arg Lys Phe Leu Gly Asp Ala Phe Thr Leu Asp Gly Lys Glu 
15 20 25 

GGA GGA TTG AAT ATG GAA AAA TTG CGC GAA GCC ATT AAA AAA GAA AAA 2 08 

Gly Gly Leu Asn Met Glu Lys Leu Arg Glu Ala lie Lys Lys Glu Lys 
30 35 40 

CCA ATC ATG AAT ATT TTG CTC ATG GGA GCT ACT GGG GTG GGT AAA AGC 256 
Pro He Met Asn He Leu Leu Met Gly Ala Thr Gly Val Gly Lys Ser 
45 50 55 60 

TCG CTC ATT AAC GCT CTA TTC GGT AAG GAA GTA GCT AAA GCA GGT GTA 3 04 

Ser Leu He Asn Ala Leu Phe Gly Lys Glu Val Ala Lys Ala Gly Val 
65 70 75 

GGA AAA CCC ATC ACT CAG CAT CTT GAA AAA TAT GTT GAT GAA GAA AAA 352 
Gly Lys Pro He Thr Gin His Leu Glu Lys Tyr Val Asp Glu Glu Lys 
80 85 90 

GGC TTG ATT TTA TGG GAC ACT AAA GGC ATT GAA GAT AAA GAT TAT GAA 4 00 

Gly Leu He Leu Trp Asp Thr Lys Gly He Glu Asp Lys Asp Tyr Glu 
95 100 105 

AAT ACC TTG GAA AGC ATT AAA AAA GAA ATG GAA GAT TCT TTT AAA ACG 44 8 

Asn Thr Leu Glu Ser He Lys Lys Glu Met Glu Asp Ser Phe Lys Thr 
110 H5 120 
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CTT GAT GAA AAA GAG GCT ATT GAT GTG GCG TAT CTG TGC GTT AAA GAG 4 96 

Leu Asp Glu Lys Glu Ala lie Asp Val Ala Tyr Leu Cys Val Lys Glu 
125 130 135 140 

ACT TCT GGT AGG GTT CAA GAG AGA GAG AGA GAG AGT TAT TAAGCTTTAC TA 54 7 
Thr Ser Gly Arg Val Gin Glu Arg Glu Arg Glu Ser Tyr 
145 150 

AAAAATGGAA TATCCCAACG ATTTTCGTTT TCACCAACAC ACAAGAAAAA GCCGGCGATG 6 07 

CCTTTGTTAA AAAAACT 6 24 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Glu Asn Asn Glu Asn His Glu Lys Leu Asn Gly Val Leu Arg Lys 

15 10 15 

Phe Leu Gly Asp Ala Phe Thr Leu Asp Gly Lys Glu Gly Gly Leu Asn 

20 25 30 

Met Glu Lys Leu Arg Glu Ala lie Lys Lys Glu Lys Pro lie Met Asn 

35 40 45 

lie Leu Leu Met Gly Ala Thr Gly Val Gly Lys Ser Ser Leu lie Asn 

50 55 60 

Ala Leu Phe Gly Lys Glu Val Ala Lys Ala Gly Val Gly Lys Pro lie 
65 70 75 80 

Thr Gin His Leu Glu Lys Tyr Val Asp Glu Glu Lys Gly Leu lie Leu 

85 90 95 

Trp Asp Thr Lys Gly lie Glu Asp Lys Asp Tyr Glu Asn Thr Leu Glu 

100 105 110 

Ser lie Lys Lys Glu Met Glu Asp Ser Phe Lys Thr Leu Asp Glu Lys 

115 120 125 

Glu Ala lie Asp Val Ala Tyr Leu Cys Val Lys Glu Thr Ser Gly Arg 

130 135 140 

Val Gin Glu Arg Glu Arg Glu Ser Tyr 
145 150 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 
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( A) NAME /KEY : Coding Sequence 

(B) LOCATION: 155... 1033 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GATGTTGTTA AGTCGTTGTT TATTATGTTA CACTAAAAGC TTAAATAAAA GGGCATAAGG 6 0 

GATAAAGGGA G TG T TAG TAG ATAGTTTTAA TAGGGTTATT GACTATATTA GGGTTTCTGT 12 0 

AACCAAACAG TGCAATTTCA GGTGTCAGTA TTGC ATG CCT GCT ACG CCA TTA AAT 17 5 

Met Pro Ala Thr Pro Leu Asn 
1 5 

TTT TTT GAT AAT GAA GAA TTA TTG CCT TTG GAT AAT GTT TTA GAA TTT 22 3 
Phe Phe Asp Asn Glu Glu Leu Leu Pro Leu Asp Asn Val Leu Glu Phe 
10 15 20 

CTC AAA ATC GCC ATT GAT GAG GGC GTT AAA AAA ATT AGA ATC ACG GGT 271 
Leu Lys lie Ala lie Asp Glu Gly Val Lys Lys He Arg He Thr Gly 
25 30 35 

GGG GAG CCG CTA TTA CGC AAA GGC TTA GAT GAA TTT ATC GCT AAA TTG 319 
Gly Glu Pro Leu Leu Arg Lys Gly Leu Asp Glu Phe He Ala Lys Leu 
40 45 ' 50 55 

CAC GCT TAC AAT AAA GAA GTG GAG TTA GTT TTA AGC ACT AAT GGT TTT 3 67 

His Ala Tyr Asn Lys Glu Val Glu Leu Val Leu Ser Thr Asn Gly Phe 
60 65 70 

TTA CTC AAA AAA ATG GCT AAG GAT TTA AAA AAT GCC GGG TTA GCG CAA 415 
Leu Leu Lys Lys Met Ala Lys Asp Leu Lys Asn Ala Gly Leu Ala Gin 
75 80 85 

GTG AAT GTT TCA TTG GAT TCT TTA AAA AGC GAT AGG GTT TTA AAA ATC 4 63 

Val Asn Val Ser Leu Asp Ser Leu Lys Ser Asp Arg Val Leu Lys lie 
90 95 100 

TCT CAA AAA GAC GCT CTT AAA AAC ACG CTA GAA GGG ATT GAA GAG TCT 511 
Ser Gin Lys Asp Ala Leu Lys Asn Thr Leu Glu Gly He Glu Glu Ser 
105 HO 115 

TTG AAA GTG GGT TTA AAA CTC AAA TTA AAC ACG GTT GTG ATA AAA AGC 559 
Leu Lys Val Gly Leu Lys Leu Lys Leu Asn Thr Val Val He Lys Ser 
120 125 130 135 

GTT AAT GAT GAT GAA ATC TTA GAG CTT TTA GAA TAC GCA AAA AAT AGG 607 
Val Asn Asp Asp Glu He Leu Glu Leu Leu Glu Tyr Ala Lys Asn Arg 
140 145 150 

CAT ATA CAA ATC CGC TAC ATT GAA TTT ATG GAA AAC ACG CAT GCT AAA 655 
His He Gin He Arg Tyr He Glu Phe Met Glu Asn Thr His Ala Lys 
155 160 165 

AGT TTG GTT AAA GGC TTG AAA GAG CGA GAA ATT TTA GAT TTG ATC GCT 703 
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Ser Leu Val Lys Gly Leu Lys Glu Arg Glu He Leu Asp Leu He Ala 
170 175 180 

CAA AAA TAT CAA ATC ATT GAG GCA GAA AAA CCC AAA CAA GGG TCT TCT 751 
Gin Lys Tyr Gin He He Glu Ala Glu Lys Pro Lys Gin Gly Ser Ser 
185 190 195 

AAA ATC TAC ACG CTA GAA AAT GGC TAT CAA TTT GGC ATT ATC GCT CCG 7 99 

Lys He Tyr Thr Leu Glu Asn Gly Tyr Gin Phe Gly He He Ala Pro 
200 205 210 215 

CAT AGC GAT GAT TTT TGC CAA TCT TGC AAT CGT ATC CGT TTG GCT TCT 84 7 

His Ser Asp Asp Phe Cys Gin Ser Cys Asn Arg He Arg Leu Ala Ser 
220 225 230 

GAT GGT AAG ATT TGC CCA TGT TTA TAC TAT CAA GAC GCC ATA GAC GCT 8 95 

Asp Gly Lys He Cys Pro Cys Leu Tyr Tyr Gin Asp Ala He Asp Ala 
235 240 245 

AAA GAG GCG ATC ATC AAT AAG GAT ACA AAA AAT ATA AAA AGG CTT TTA 94 3 

Lys Glu Ala He He Asn Lys Asp Thr Lys Asn He Lys Arg Leu Leu 
250 255 260 

AAG CAA TCT GTC ATC AAT AAA CCA GAA AAA AAC ATG TGG AAT GAT AAA 991 
Lys Gin Ser Val He Asn Lys Pro Glu Lys Asn Met Trp Asn Asp Lys 
265 270 275 

AAC AGC GAA ACT CCC ACA AGG GCG TTT TAC TAC ACA GGG GGG TAGGGGAGT 104 2 
Asn Ser Glu Thr Pro Thr Arg Ala Phe Tyr Tyr Thr Gly Gly 
280 285 290 

AAAATATTTA TTATTTTAAA CCTTTTTATT AAAAATAAGG C 10 8 3 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Pro Ala Thr Pro Leu Asn Phe Phe Asp Asn Glu Glu Leu Leu Pro 

15 10 15 

Leu Asp Asn Val Leu Glu Phe Leu Lys He Ala lie Asp Glu Gly Val 

20 25 30 

Lys Lys He Arg He Thr Gly Gly Glu Pro Leu Leu Arg Lys Gly Leu 

35 40 45 

Asp Glu Phe He Ala Lys Leu His Ala Tyr Asn Lys Glu Val Glu Leu 

50 55 60 

Val Leu Ser Thr Asn Gly Phe Leu Leu Lys Lys Met Ala Lys Asp Leu 
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65 70 75 80 

Lys Asn Ala Gly Leu Ala Gin Val Asn Val Ser Leu Asp Ser Leu Lys 

85 90 95 

Ser Asp Arg Val Leu Lys lie Sex- Gin Lys Asp Ala Leu Lys Asn Thr 

100 105 HO 

Leu Glu Gly He Glu Glu Ser Leu Lys Val Gly Leu Lys Leu Lys Leu 

115 120 125 

Asn Thr Val Val He Lys Ser Val Asn Asp Asp Glu He Leu Glu Leu 

130 135 140 

Leu Glu Tyr Ala Lys Asn Arg His He Gin He Arg Tyr He Glu Phe 
145 150 155 160 

Met Glu Asn Thr His Ala Lys Ser Leu Val Lys Gly Leu Lys Glu Arg 

165 170 175 

Glu He Leu Asp Leu He Ala Gin Lys Tyr Gin lie He Glu Ala Glu 

180 185 190 

Lys Pro Lys Gin Gly Ser Ser Lys lie Tyr Thr Leu Glu Asn Gly Tyr 

195 200 205 

Gin Phe Gly He He Ala Pro His Ser Asp Asp Phe Cys Gin Ser Cys 

210 215 220 

Asn Arg He Arg Leu Ala Ser Asp Gly Lys He Cys Pro Cys Leu Tyr 
225 230 235 240 

Tyr Gin Asp Ala He Asp Ala Lys Glu Ala He lie Asn Lys Asp Thr 

245 250 255 

Lys Asn He Lys Arg Leu Leu Lys Gin Ser Val lie Asn Lys Pro Glu 

260 265 270 

Lys Asn Met Trp Asn Asp Lys Asn Ser Glu Thr Pro Thr Arg Ala Phe 

275 280 285 

Tyr Tyr Thr Gly Gly 
290 

(2) INFORMATION FOR SEQ ID NO: 17: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 
<B) LOCATION: 121... 1137 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

ACTTCTCAAT CAGCGAGCTA TCATGCAAGG CCTTATGTGG TGGATACCGC TTTTTTACGA 6 0 

TACGATTACA AAGATGTTTT TGGGTTTAAG GCGGGGCGCT ATGAAGCGAA TATTGATTTC 12 0 

ATG AGC GGA TCG AAT CAA GGG TGG GAA GTG TAT TAT CAG CCC TAT AAG 16 8 
Met Ser Gly Ser Asn Gin Gly Trp Glu Val Tyr Tyr Gin Pro Tyr Lys 
15 10 15 

ACT GAA ACG CAA AGG TTA AGG TTT TGG TGG TGG AGT TCT TTT GGG AGA 216 
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Thr Glu Thr Gin Arg Leu Arg Phe Trp Trp Trp Ser Ser Phe Gly Arg 
20 25 30 

GGT TTA GCG TTC AAC TCT TGG ATT TAT GAG TTT TTT GCG ACG GTG CCT 2 64 

Gly Leu Ala Phe Asn Ser Trp lie Tyr Glu Phe Phe Ala Thr Val Pro 
35 40 45 

TAT TTG AAA AAG GGA GGC AAT CCT AAT AAC AGC AAC GAT TTC ATC AAT 312 

Tyr Leu Lys Lys Gly Gly Asn Pro Asn Asn Ser Asn Asp Phe lie Asn 

50 55 60 

TAT GGC TGG CAT GGA ATC ACC ACA ACC TAT TCT TAT AAA GGT TTA GAC 3 60 

Tyr Gly Trp His Gly lie Thr Thr Thr Tyr Ser Tyr Lys Gly Leu Asp 

65 70 75 80 

GCT CAA TTT TTT TAT TAT TTT GCG CCT AAG ACT TAT AAC GCT CCT GGC 4 08 

Ala Gin Phe Phe Tyr Tyr Phe Ala Pro Lys Thr Tyr Asn Ala Pro Gly 

85 90 95 

TTT AAG CTG GTC TAT GAC ACG AAT AGG AAT TTT CAA AAT GTA GGC TTT 4 56 

Phe Lys Leu Val Tyr Asp Thr Asn Arg Asn Phe Gin Asn Val Gly Phe 
100 105 110 

CGC TCT CAA AGC ATG ATC ATG ACA ACC TTT CCT TTA TAC TAT AGA GGG 5 04 

Arg Ser Gin Ser Met lie Met Thr Thr Phe Pro Leu Tyr Tyr Arg Gly 
115 120 125 

TGG TAT AAC CCA GAG ACA AAC ACT TAT AGT TTA GAA GAC AGC ACG CCT 5 52 

Trp Tyr Asn Pro Glu Thr Asn Thr Tyr Ser Leu Glu Asp Ser Thr Pro 

130 135 140 

CAT GGC TCG TTG TTG GGG AGG AAT GGC GTT ACT TTA AAT ATC CGC CAG 600 

His Gly Ser Leu Leu Gly Arg Asn Gly Val Thr Leu Asn lie Arg Gin 

145 150 155 160 

GTT TTT TGG TGG GAT AAT TTC AAC TGG TCC ATT GGC TTT TAT AAC ACC 64 8 

Val Phe Trp Trp Asp Asn Phe Asn Trp Ser lie Gly Phe Tyr Asn Thr 

165 170 175 

TTT GGC AAT TCG GAC GCT TTT TTA GGC TCT CAC ACG ATG CCA AGG GGT 6 96 

Phe Gly Asn Ser Asp Ala Phe Leu Gly Ser His Thr Met Pro Arg Gly 
180 185 190 

AAT AAC ACT TCC TAT ATC GGT AGT GAA ATC TCC ATA ACG ACT AGG CAT 74 4 

Asn Asn Thr Ser Tyr lie Gly Ser Glu lie Ser lie Thr Thr Arg His 
195 200 205 

GCC GGA ATG ATT GGC TAT GAT TTT TGG GAT AAT ACG GCT TAT GAT GGG 7 92 

Ala Gly Met lie Gly Tyr Asp Phe Trp Asp Asn Thr Ala Tyr Asp Gly 

210 215 220 

CTA GCT GAT GCG ATC ACT AAC GCT AAC ACT TTC ACT TTT TAC ACT TCT 84 0 

Leu Ala Asp Ala lie Thr Asn Ala Asn Thr Phe Thr Phe Tyr Thr Ser 

225 230 235 240 
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GTT GGA GGG ATC CAT AAG CGT TTT GCA TGG CAT GTT TTT GGG CGC GTC 

val Gly Gly He His Lys Arg Phe Ala Trp His Val Phe Gly Arg Val 

245 250 255 

TCT CAT GCG AAT AAA AAC GCG TTA GGG CAA GTG GGG AGG GCT AAT GAA 93 6 

Ser His Ala Asn Lys Asn Ala Leu Gly Gin Val Gly Arg Ala Asn Glu 
260 265 270 

TAT TCC TTG CAA TTC AAC GCG AGC TAT GCG TTC ACT GAA TCA ATC CTT 984 
Tyr Ser Leu Gin Phe Asn Ala Ser Tyr Ala Phe Thr Glu Ser He Leu 
275 280 285 

CTT AAC TTT AGG ATC ACT TAT TAT GGG GCT AGG ATC AAT AAA GGG TAT 10 3 2 

Leu Asn Phe Arg He Thr Tyr Tyr Gly Ala Arg He Asn Lys Gly Tyr 
290 295 300 

CAA GCG GGG TAT TTT GGA GCG CCC AAA TTC AAT AAC CCT GAT GGC GAT 10 8 0 

Gin Ala Gly Tyr Phe Gly Ala Pro Lys Phe Asn Asn Pro Asp Gly Asp 
305 310 315 320 

TTT AGC GCT AAT TAC CAA GAC AGA AGT TAC ATG ATG ACC AAC CTC ACG 112 8 

Phe Ser Ala Asn Tyr Gin Asp Arg Ser Tyr Met Met Thr Asn Leu Thr 
325 330 335 

CTG AAG TTT TGATTTCCAA TCACAGCGAG TTAAAAACAC TCCAAGGCAT TTTT 1181 
Leu Lys Phe 



(2) INFORMATION FOR SEQ ID NO: 18: 

{i} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Gly Ser Asn Gin Gly Trp Glu Val Tyr Tyr Gin Pro Tyr Lys 

15 10 15 

Thr Glu Thr Gin Arg Leu Arg Phe Trp Trp Trp Ser Ser Phe Gly Arg 

20 25 30 

Gly Leu Ala Phe Asn Ser Trp He Tyr Glu Phe Phe Ala Thr Val Pro 

35 40 45 

Tyr Leu Lys Lys Gly Gly Asn Pro Asn Asn Ser Asn Asp Phe He Asn 

50 55 60 

Tyr Gly Trp His Gly He Thr Thr Thr Tyr Ser Tyr Lys Gly Leu Asp 
65 70 75 ' 80 

Ala Gin Phe Phe Tyr Tyr Phe Ala Pro Lys Thr Tyr Asn Ala Pro Gly 

85 90 95 

Phe Lys Leu Val Tyr Asp Thr Asn Arg Asn Phe Gin Asn Val Gly Phe 
100 * 105 HO 



BNSDOCID: <WO 9821225A1_L> 



WO 98/21225 



PCT/US97/213S3 



-101- 

Arg Ser Gin Ser Met lie Met Thr Thr Phe Pro Leu Tyr Tyr Arg Gly 

115 120 125 

Trp Tyr Asn Pro Glu Thr Asn Thr Tyr Ser Leu Glu Asp Ser Thr Pro 

130 135 140 

His Gly Ser Leu Leu Gly Arg Asn Gly Val Thr Leu Asn lie Arg Gin 
145 150 155 160 

Val Phe Trp Trp Asp Asn Phe Asn Trp Ser lie Gly Phe Tyr Asn Thr 

165 170 175 

Phe Gly Asn Ser Asp Ala Phe Leu Gly Ser His Thr Met Pro Arg Gly 

180 185 190 

Asn Asn Thr Ser Tyr lie Gly Ser Glu lie Ser lie Thr Thr Arg His 

195 200 205 

Ala Gly Met lie Gly Tyr Asp Phe Trp Asp Asn Thr Ala Tyr Asp Gly 

210 215 220 

Leu Ala Asp Ala lie Thr Asn Ala Asn Thr Phe Thr Phe Tyr Thr Ser 
225 230 235 240 

Val Gly Gly lie His Lys Arg Phe Ala Trp His Val Phe Gly Arg Val 

245 250 255 

Ser His Ala Asn Lys Asn Ala Leu Gly Gin Val Gly Arg Ala Asn Glu 

260 265 270 

Tyr Ser Leu Gin Phe Asn Ala Ser Tyr Ala Phe Thr Glu Ser lie Leu 

275 280 285 

Leu Asn Phe Arg lie Thr Tyr Tyr Gly Ala Arg lie Asn Lys Gly Tyr 

290 295 300 

Gin Ala Gly Tyr Phe Gly Ala Pro Lys Phe Asn Asn Pro Asp Gly Asp 
305 310 315 ' 320 

Phe Ser Ala Asn Tyr Gin Asp Arg Ser Tyr Met Met Thr Asn Leu Thr 
325 330 335 

Leu Lys Phe 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 959 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 133 ...879 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 

TAAGGAAATG AGTTTTTATA TCATAAAATA AAGTAACCGA GAAAAATCTT TCTCTAAAAA 6 0 

TAATACTTTT TTAGTTATAA TAACAATTTT GTTTTTTCAA AAACAATAAT T AC T AT ATTT 12 0 

AGGATTTTAA GA ATG AAT GAC AAG CGT TTT AGA AAA TAT TGT AGT TTT TCT 171 
Met Asn Asp Lys Arg Phe Arg Lys Tyr Cys Ser Phe Ser 
15 10 
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ATT TTT TTG TCC TTA TTA GGA ACG TTT GAA TTA GAG GCT AAA GAA GAA 219 
He Phe Leu Ser Leu Leu Gly Thr Phe Glu Leu Glu Ala Lys Glu Glu 
15 20 25 

GAA GAA AAA GAA GAA AGA AAG ACA GAA AGG AAA AAA GAA AAG AAC GCC 2 67 

Glu Glu Lys Glu Glu Arg Lys Thr Glu Arg Lys Lys Glu Lys Asn Ala 
30 35 40 45 

CAA CAC ACT CTA GGC AAG GTT ACC ACT CAA GCG GCT AAA ATC TTT AAC 315 
Gin His Thr Leu Gly Lys Val Thr Thr Gin Ala Ala Lys He Phe Asn 
50 55 60 

TAC AAC AAC CAG ACA ACC ATT TCA AGT AAG GAA TTA GAA AGA AGG CAA 3 63 

Tyr Asn Asn Gin Thr Thr lie Ser Ser Lys Glu Leu Glu Arg Arg Gin 
65 70 75 

GCC AAC CAA ATC AGC GAC ATG TTT AGA AGA AAC CCT AAT ATC AAT GTG 411 
Ala Asn Gin He Ser Asp Met Phe Arg Arg Asn Pro Asn He Asn Val 
80 85 90 

GGC GGT GGT GCG GTG ATA GCG CAA AAA ATT TAT GTG CGC GGT ATT GAA 4 59 

Gly Gly Gly Ala Val He Ala Gin Lys He Tyr Val Arg Gly He Glu 
95 100 105 

GAC AGA TTG GCT CGG GTT ACG GTG GAT GGG GCG GCG CAA ATG GGT GCA 5 07 

Asp Arg Leu Ala Arg Val Thr Val Asp Gly Ala Ala Gin Met Gly Ala 
110 US 120 125 

AGC TAT GGG CAT CAA GGC AAT ACG ATC ATT GAC CCT GGA ATG CTT AAA 5 55 

Ser Tyr Gly His Gin Gly Asn Thr He He Asp Pro Gly Met Leu Lys 
130 135 140 

AGC GTG GTG GTT ACT AAA GGG GCG GCT CAA GCG AGC GCG GGG CCT ATG 6 03 

Ser Val Val Val Thr Lys Gly Ala Ala Gin Ala Ser Ala Gly Pro Met 
145 150 155 

GCT TTG ATT GGC GCG ATT AAA ATG GAG ACT AAA AGT GCT AGC GAT TTT 651 
Ala Leu He Gly Ala He Lys Met Glu Thr Lys Ser Ala Ser Asp Phe 
160 165 170 

ATC CCT AAA GGT AAA GAC TAC GCC ATA AGT GGG GCT GCC ACT TTT TTA 6 99 

He Pro Lys Gly Lys Asp Tyr Ala He Ser Gly Ala Ala Thr Phe Leu 
175 180 185 

ACC AAC TTT GGG GAT CGA GAA ACC GTG ATG GGC GCT TAT CGT CAT AAT 74 7 

Thr Asn Phe Gly Asp Arg Glu Thr Val Met Gly Ala Tyr Arg His Asn 
190 195 200 205 

CAT TTT GAT GCG CTT TTG TAT TAC ACG CAT CAA AAT ATT TTT TAC TAT 7 95 

His Phe Asp Ala Leu Leu Tyr Tyr Thr His Gin Asn He Phe Tyr Tyr 
210 215 220 

CGT GAT GGG GAT AAT GCT ACA AAA GAT CTC TTT AGA CCT AAA GCG GAG 84 3 

Arg Asp Gly Asp Asn Ala Thr Lys Asp Leu Phe Arg Pro Lys Ala Glu 
225 230 235 
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AAT AAA GTT ACA GAA GTC CTA GCG AGC AAA ACA ATG TGATGGCTAA GATCAA 8 95 
Asn Lys Val Thr Glu Val Leu Ala Ser Lys Thr Met 
240 245 

TGGTTATTTG AGCGAAAGGG ATATTTTAAC GCTCAGTTAT AAC ATG AC C A GAGACAACGC 95 5 

TAAC 95 9 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Asn Asp Lys Arg Phe Arg Lys Tyr Cys Ser Phe Ser lie Phe Leu 

15 10 15 

Ser Leu Leu Gly Thr Phe Glu Leu Glu Ala Lys Glu Glu Glu Glu Lys 

20 25 30 

Glu Glu Arg Lys Thr Glu Arg Lys Lys Glu Lys Asn Ala Gin His Thr 

35 40 45 

Leu Gly Lys Val Thr Thr Gin Ala Ala Lys lie Phe Asn Tyr Asn Asn 

50 55 60 

Gin Thr Thr lie Ser Ser Lys Glu Leu Glu Arg Arg Gin Ala Asn Gin 
65 70 75 80 

lie Ser Asp Met Phe Arg Arg Asn Pro Asn lie Asn Val Gly Gly Gly 

85 90 95 

Ala Val lie Ala Gin Lys lie Tyr Val Arg Gly lie Glu Asp Arg Leu 

100 105 110 

Ala Arg Val Thr Val Asp Gly Ala Ala Gin Met Gly Ala Ser Tyr Gly 

115 120 125 

His Gin Gly Asn Thr lie lie Asp Pro Gly Met Leu Lys Ser Val Val 

130 135 140 

Val Thr Lys Gly Ala Ala Gin Ala Ser Ala Gly Pro Met Ala Leu lie 
145 150 155 160 

Gly Ala lie Lys Met Glu Thr Lys Ser Ala Ser Asp Phe lie Pro Lys 

165 170 175 

Gly Lys Asp Tyr Ala lie Ser Gly Ala Ala Thr Phe Leu Thr Asn Phe 

180 185 190 

Gly Asp Arg Glu Thr Val Met Gly Ala Tyr Arg His Asn' His Phe Asp 

195 200 205 

Ala Leu Leu Tyr Tyr Thr His Gin Asn lie Phe Tyr Tyr Arg Asp Gly 

210 215 220 

Asp Asn Ala Thr Lys Asp Leu Phe Arg Pro Lys Ala Glu Asn Lys Val 
225 230 235 240 

Thr Glu Val Leu Ala Ser Lys Thr Met 
245 



(2) INFORMATION FOR SEQ ID NO : 2 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 4 0.. -12 6 6 
(D) OTHER INFORMATION: 



(A) NAME /KEY : sig_peptide 

(B) LOCATION: 40... 219 
(D) OTHER INFORMATION: 



(A) NAME /KEY : mat_pept ide 

(B) LOCATION: 220... 1266 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

TTTGACAGCT TATCATTTGG CAATAAAACA CCAAAATGA ATG AGT TAC AC A AAA 54 

Met Ser Tyr Thr Lys 
-60 

AAA TAC TCA ACA CCA CCC AAC CGG CGT AAA ATG CAA AAC ATT ATC GCT 102 
Lys Tyr Ser Thr Pro Pro Asn Arg Arg Lys Met Gin Asn lie lie Ala 
_ 55 -50 -45 -40 

ATT AAA AGA TCC TCT AGA GTC GAC CTG CAG GCA TGC AAG CTA GCT TTC 150 
lie Lys Arg Ser Ser Arg Val Asp Leu Gin Ala Cys Lys Leu Ala Phe 
-35 -30 -25 

GCG AGC TCG AGA TCA CCC ATG CAA TTT CAA AAA ACC TTA TTT CCT TTA 198 
Ala Ser Ser Arg Ser Pro Met Gin Phe Gin Lys Thr Leu Phe Pro Leu 
-20 -15 -10 

CCC TTA TTA TTT TTA TCT TGT TGT ATC GCT GAA GAA AAT GGG GCG TAT 24 6 

Pro Leu Leu Phe Leu Ser Cys Cys lie Ala Glu Glu Asn Gly Ala Tyr 
-5 1 5 

GCG AGC GTG GGG TTT GAA TAT TCC ATT AGT CAT GCC GTT GAG CAT AAT . 2 94 
Ala Ser Val Gly Phe Glu Tyr Ser He Ser His Ala Val Glu His Asn 
10 15 20 25 

AAC CCT TTT TTA AAT CAA GAA CGC ATC CAA ATC ATT TCT AAC GCT CAA 342 
Asn Pro Phe Leu Asn Gin Glu Arg He Gin He He Ser Asn Ala Gin 
30 35 40 

AAC AAA ATC TAT AAA CTC AAT CAA GTC AAA AAT GAA ATC ACA AGC ATG 3 90 
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Asn Lys lie Tyr Lys Leu Asn Gin Val Lys Asn Glu lie Thr Ser Met 
45 50 55 

CAA AAC ACC TTT AAT TAC ATC AAC AAC GCT TTA AAA AAC AAT GCT AAA 43 8 

Gin Asn Thr Phe Asn Tyr lie Asn Asn Ala Leu Lys Asn Asn Ala Lys 
60 65 70 

TTA ACC CCC ACT GAA ATC CAA GCT GAG AAA TAC TAC CTC CAA TCC ACC 4 86 

Leu Thr Pro Thr Glu lie Gin Ala Glu Lys Tyr Tyr Leu Gin Ser Thr 
75 80 85 

CTT CAA AAC ATT GAA AAA ATA GTC ACA CTT AGC GGT GGC GTT GCA TCT 53 4 

Leu Gin Asn lie Glu Lys lie Val Thr Leu Ser Gly Gly Val Ala Ser 
90 95 100 105 

AAC CCC AAA CTA GTC CAA GCG TTG GAA AAA ATG CAA GAA CCC ATT ACT 5 82 

Asn Pro Lys Leu Val Gin Ala Leu Glu Lys Met Gin Glu Pro lie Thr 
110 115 120 

AAC CCT TTA GAA TTA GCA GAA AAC TTA AGA AAT TTA GAA TTG CAA TTT 63 0 

Asn Pro Leu Glu Leu Ala Glu Asn Leu Arg Asn Leu Glu Leu Gin Phe 
125 130 135 

GCT CAA TCT CAA AAC CGC ATG CTT TCT TCT TTA TCT TCT CAA ACC GCT 67 8 

Ala Gin Ser Gin Asn Arg Met Leu Ser Ser Leu Ser Ser Gin Thr Ala 
140 145 150 

CAA ATT TCA AAT TCT TTG AAC GCG CTT GAT CCC AGC TCT TAT TCT AAA 72 6 

Gin lie Ser Asn Ser Leu Asn Ala Leu Asp Pro Ser Ser Tyr Ser Lys 
155 160 165 

AAC ATT TCA AGC ATG TCT GGG GTG AGT TTG AGC GTA GGG TAT AAG CAT 774 
Asn lie Ser Ser Met Ser Gly Val Ser Leu Ser Val Gly Tyr Lys His 
170 175 180 185 

TTC TTT ACT AAG AAA AAA AAT CAA GGG TTT CGC TAT TAC TTG TTT TAT 82 2 

Phe Phe Thr Lys Lys Lys Asn Gin Gly Phe Arg Tyr Tyr Leu Phe Tyr 
190 195 200 

GAC TAT GGT TAC ACT AAC TTT GGT TTT GTG GGT AAT GGC TTT GAT GGT 87 0 

Asp Tyr Gly Tyr Thr Asn Phe Gly Phe Val Gly Asn Gly Phe Asp Gly 
205 210 215 

TTA GGC AAA ATG AAT AAC CAC CTC TAT GGG CTT GGA ATA AAC TAC CTT 918 
Leu Gly Lys Met Asn Asn His Leu Tyr Gly Leu Gly lie Asn Tyr Leu 
220 225 230 

TAT AAT TTC ATT GAT AAT GCA CAA AAA CAT TCG AGC GTG GGT TTT TAT 966 
Tyr Asn Phe lie Asp Asn Ala Gin Lys His Ser Ser Val Gly Phe Tyr 
235 240 245 

GCG GGT TTT GCT TTG GCG GGG AAT TCG TGG GTA GGG AAT GGT TTA GGC 1014 
Ala Gly Phe Ala Leu Ala Gly Asn Ser Trp Val Gly Asn Gly Leu Gly 
250 255 260 265 
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ATG TGG GTG AGC CAA ACG GAT TTT ATC AAC AAT TAC TTG ATG GGC TAT 1062 

Met Trp Val Ser Gin Thr Asp Phe He Asn Asn Tyr Leu Met Gly Tyr 
270 275 280 

CAA GCT AAA ATA CAC ACG AAC TTT TTC CAG ATC CCT TTG AAT TTT GGG 1110 
Gin Ala Lys He His Thr Asn Phe Phe Gin He Pro Leu Asn Phe Gly 
285 290 295 

GTT CGT GTG AAT GTC AAT AGG CAT AAC GGA TTT GAA ATG GGC CTA AAA 115 8 

Val Arg Val Asn Val Asn Arg His Asn Gly Phe Glu Met Gly Leu Lys 
300 305 310 

ATC CCT TTA GCG GTG AAT TCC TTT TAT GAA ACG CAT GGC AAA GGG TTA 12 0 6 

He Pro Leu Ala Val Asn Ser Phe Tyr Glu Thr His Gly Lys Gly Leu 
315 320 325 

AAC ACT TCC CTC TTT TTC AAA CGC CTT GTG GTG TTT AAT GTG AGT TAT 12 54 

Asn Thr Ser Leu Phe Phe Lys Arg Leu Val Val Phe Asn Val Ser Tyr 
330 335 340 345 

GTT TAT AGT TTT TAGGGGGTAA ATGCCTTCAA ACGCTCTTTT GATTGAAGAA 13 0 6 

Val Tyr Ser Phe 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 09 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Ser Tyr Thr Lys Lys Tyr Ser Thr Pro Pro Asn Arg Arg Lys Met 
-60 -55 -50 -45 

Gin Asn He He Ala He Lys Arg Ser Ser Arg Val Asp Leu Gin Ala 

-40 -35 -30 

Cys Lys Leu Ala Phe Ala Ser Ser Arg Ser Pro Met Gin Phe Gin Lys 

-25 -20 -15 

Thr Leu Phe Pro Leu Pro Leu Leu Phe Leu Ser Cys Cys He Ala Glu 

-10 -5 1 

Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe Glu Tyr Ser He Ser His 

5 10 15 20 

Ala Val Glu His Asn Asn Pro Phe Leu Asn Gin Glu Arg He Gin He 

25 30 35 

He Ser Asn Ala Gin Asn Lys He Tyr Lys Leu Asn Gin Val Lys Asn 

40 45 50 

Glu He Thr Ser Met Gin Asn Thr Phe Asn Tyr He Asn Asn Ala Leu 

55 60 65 

Lys Asn Asn Ala Lys Leu Thr Pro Thr Glu He Gin Ala Glu Lys Tyr 
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70 75 80 

Tyr Leu Gin Ser Thr Leu Gin Asn lie Glu Lys lie Val Thr Leu Ser 
85 90 95 100 

Gly Gly Val Ala Ser Asn Pro Lys Leu Val Gin Ala Leu Glu Lys Met 

105 110 115 

Gin Glu Pro lie Thr Asn Pro Leu Glu Leu Ala Glu Asn Leu Arg Asn 

120 125 130 

Leu Glu Leu Gin Phe Ala Gin Ser Gin Asn Arg Met Leu Ser Ser Leu 

135 140 145 

Ser Ser Gin Thr Ala Gin lie Ser Asn Ser Leu Asn Ala Leu Asp Pro 

150 155 160 

Ser Ser Tyr Ser Lys Asn lie Ser Ser Met Ser Gly Val Ser Leu Ser 
165 170 175 180 

Val Gly Tyr Lys His Phe Phe Thr Lys Lys Lys Asn Gin Gly Phe Arg 

185 190 195 

Tyr Tyr Leu Phe Tyr Asp Tyr Gly Tyr Thr Asn Phe Gly Phe Val Gly 

200 205 210 

Asn Gly Phe Asp Gly Leu Gly Lys Met Asn Asn His Leu Tyr Gly Leu 

215 220 225 

Gly lie Asn Tyr Leu Tyr Asn Phe lie Asp Asn Ala Gin Lys His Ser 

230 235 240 

Ser Val Gly Phe Tyr Ala Gly Phe Ala Leu Ala Gly Asn Ser Trp Val 
245 250 255 260 

Gly Asn Gly Leu Gly Met Trp Val Ser Gin Thr Asp Phe lie Asn Asn 

265 270 275 

Tyr Leu Met Gly Tyr Gin Ala Lys lie His Thr Asn Phe Phe Gin lie 

280 285 290 

Pro Leu Asn Phe Gly Val Arg Val Asn Val Asn Arg His Asn Gly Phe 

295 300 305 

Glu Met Gly Leu Lys lie Pro Leu Ala Val Asn Ser Phe Tyr Glu Thr 

310 315 320 

His Gly Lys Gly Leu Asn Thr Ser Leu Phe Phe Lys Arg Leu Val Val 
325 330 335 340 

Phe Asn Val Ser Tyr Val Tyr Ser Phe 
345 



(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 342 . . . 824 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 23: 
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CACTCTAAGC GTCAAACTCT CTTTTTCTTT AGAGGAAGAA AGCAAGCGGA TCCATCTTAA 6 0 

AGCCTTACAA AATATCTTAA ATAACGCTAA AAGCGCGCAT TTTAAATTTG TTTTAGAGAG 12 0 

CCAAAACGCC GCTCAATCTA TTATAGAAAT TCAAAGCCTC TTGAAACAAC TCTCCTTAAA 18 0 

AAATAATGAA ATCTTTTTAA TGCCTTTAGG CACAAATAAC AACGAGCTAG ACAAAAATCT 24 0 

AAAAACCCTA GCCCCCCTAG CCATAAAGCA TGGTTTCAGG CTAAGCGATA GGCTTCATAT 3 00 

CCGCTTGTGG GATAATCAAA AAGGGTTTTA AAAAGTTAAT C ATG ACC ATC AAA GTT 35 6 

Met Thr lie Lys Val 
1 5 



TTT TCG CCC 
Phe Ser Pro 



ATC GCT GAC 
lie Ala Asp 



AGT ATT AGC 
Ser lie Ser 
40 

TTT GAA ATG 
Phe Glu Met 
55 

CCT ATC AAT 
Pro lie Asn 
70 

GCT TAT CAA 
Ala Tyr Gin 



AAA TAC GCT 
Lys Tyr Ala 



AAA AGC CAA 
Lys Ser Gin 
120 

CAC ATC AAA 
His lie Lys 
135 



AAA TAC CCC 
Lys Tyr Pro 
10 

AAC CCT TTA 
Asn Pro Leu 
25 

GGG TTC GTT 
Gly Phe Val 



AGA GAG GGT 
Arg Glu Gly 



CAA ACA GAA 
Gin Thr Glu 
75 

TCC CAA GGG 
Ser Gin Gly 
90 

TTC ATT AAA 
Phe lie Lys 
105 

ATC AAG GCA 
lie Lys Ala 

GAA GAG GAT 
Glu Glu Asp 



ACT GAA TTA 
Thr Glu Leu 



GGG TTT ATC 
Gly Phe lie 
30 

CAA AAA TTG 
Gin Lys Leu 
45 

AAC AAG CTC 
Asn Lys Leu 
60 

GCC GAG CTG 
Ala Glu Leu 



CTA GGT CAA 
Leu Gly Gin 



GGC TAT ACT 

Gly Tyr Thr 
110 

TGC AAC CTC 

Cys Asn Leu 
125 

TGC GTG GTG 

Cys Val Val 
140 



GAA GAA TTT 
Glu Glu Phe 
15 

CAA CGC TTG 
Gin Arg Leu 



CGC GAG CAT 
Arg Glu His 



ATT GGG ATT 
lie Gly He 
65 

TGC AAA TTC 
Cys Lys Phe 
80 

AAA CTC TAT 
Lys Leu Tyr 
95 

AAA ATC TCT 
Lys He Ser 



TAT CAA AAG 
Tyr Gin Lys 

GAG TTG GGC 
Glu Leu Gly 
145 



TAT GCT GAG 
Tyr Ala Glu 
20 

GAT CTT TTG 
Asp Leu Leu 
35 

GGC GGG GAA 
Gly Gly Glu 
50 

TGT GGG CTT 
Cys Gly Leu 

CAC ATA AAT 
His He Asn 



GAG AGC GTG 
Glu Ser Val 
100 

CTG CAT GTG 
Leu His Val 
115 

CTG GGT TTT 
Leu Gly Phe 
130 

GAA GAG ACT 
Glu Glu Thr 



CGT 404 
Arg 

CCT 4 52 

Pro 



TTT 50 0 

Phe 



AAT 54 8 

Asn 



AGT 596 

Ser 

85 

GAG 64 4 

Glu 



AGC 6 92 

Ser 



GTG 740 
Val 

TTG 788 
Leu 



ATT TTC CCC ACT CTT TTT ATG GAA AAG ATT TTG TCT TGATTGGTGC ATC CAT 84 0 
He Phe Pro Thr Leu Phe Met Glu Lys He Leu Ser 
150 155 160 

TTGACACACG CCCAAGCGAC ATTCAAACTA TCAAACTTTC AT TAACAC AA CCCAATTAAC 
GCTAAATAAA CCCTAAAACA AACACTCGTT GTTAAAATTT TGTTTTTCAA GCGCTTCGCA 
AAG TTT TAG A AGCCCTATTT AGGGGTTAAC GCTAAAATAG GCTATCAAAA C TACTTT AAT 
GATTTTATAG 



(2) INFORMATION FOR SEQ ID NO:24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Thr lie Lys Val Phe Ser Pro Lys Tyr Pro Thr Glu Leu Glu Glu 

15 10 15 

Phe Tyr Ala Glu Arg lie Ala Asp Asn Pro Leu Gly Phe lie Gin Arg 

20 25 30 

Leu Asp Leu Leu Pro Ser lie Ser Gly Phe Val Gin Lys Leu Arg Glu 

35 40 45 

His Gly Gly Glu Phe Phe Glu Met Arg Glu Gly Asn Lys Leu lie Gly 

50 55 60 

lie Cys Gly Leu Asn Pro lie Asn Gin Thr Glu Ala Glu Leu Cys Lys 
65 70 75 80 

Phe His lie Asn Ser Ala Tyr Gin Ser Gin Gly Leu Gly Gin Lys Leu 

85 90 95 

Tyr Glu Ser Val Glu Lys Tyr Ala Phe lie Lys Gly Tyr Thr Lys lie 

100 105 110 

Ser Leu His Val Ser Lys Ser Gin lie Lys Ala Cys Asn Leu Tyr Gin 

115 120 125 

Lys Leu Gly Phe Val His lie Lys Glu Glu Asp Cys Val Val Glu Leu 

130 135 140 

Gly Glu Glu Thr Leu lie Phe Pro Thr Leu Phe Met Glu Lys lie Leu 
145 150 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO: 25: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 374. -.1267 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



CGTGGAGTTT TTTAGGCATT TCTTTATATT 
CAACTAAAAT CTCTACTAAC AATTCATCTG 
AAAATCACTT TTAAGACTAA ATCATGTTAG 



CATTCAATAA CGCTTGCGCG GGCAATTCTT 6 0 

AATGCAAAAT CTCAATTCTC CCTAAAAAAC 12 0 

AATTATACTT GAATTTACAC TCAGTTTAGT 18 0 
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TTATTTCTTA ATACAAAAGG TAGGCGTTTT GAAACATTTA ACCCCACTCA CTCACACCAT 24 0 

CTTTAAAGCC TTATGGCTAG GCACAGCCTT AAGTGCATCT TTAAGTTTAG CCGCAACAGA 3 00 

AAGCCCCACT AAAAC AG AG C CTAAGCCCGC TAAAGGGGTT AAAAACAAGC CCAAATCGCC 360 
CGTTACTAAA GTC ATG ATG ACC AAT TGC GAC AAT ATT AAA GAT TTT AAC 4 09 

Met Met Thr Asn Cys Asp Asn He Lys Asp Phe Asn 
15 10 

GCT AAG CAA AAA GAA GTC TTA AAA GCC GCT TAT CAA TTC GGC TCT AAA 4 57 

Ala Lys Gin Lys Glu Val Leu Lys Ala Ala Tyr Gin Phe Gly Ser Lys 
15 20 25 

GAA AAT TTA GGC TAT GAA ATG GCA GGC ATT GCA TGG AAA GAG TCA TGC 5 05 

Glu Asn Leu Gly Tyr Glu Met Ala Gly He Ala Trp Lys Glu Ser Cys 
30 35 40 

GCA GGG GTT TAT AAA ATC AAT TTT TCG GAT CCG AGC GCG GGC GTG TAT 5 53 

Ala Gly Val Tyr Lys He Asn Phe Ser Asp Pro Ser Ala Gly Val Tyr 
45 50 55 60 

CAT TCT TAT ATC CCA AGC GTT CTA AAA AGC TAT GGG CAT AAT GAT AGC 601 
His Ser Tyr He Pro Ser Val Leu Lys Ser Tyr Gly His Asn Asp Ser 
65 70 75 

CCC TTT TTG CGT AAT GTG ATG GGG GAA TTG CTC ATT AAA GAC GAT GCG 64 9 

Pro Phe Leu Arg Asn Val Met Gly Glu Leu Leu He Lys Asp Asp Ala 
80 85 90 

TTT GCT TCT GAA GTG GCT TTA AAA GAG TTG CTC TAT TGG AAA ACA CGC 6 97 

Phe Ala Ser Glu Val Ala Leu Lys Glu Leu Leu Tyr Trp Lys Thr Arg 
95 100 105 

TAC CAT GAC AAT TTA AAA GAC ATG ATT AAA TCT TAC AAC AAG GGC AGT 74 5 

Tyr His Asp Asn Leu Lys Asp Met He Lys Ser Tyr Asn Lys Gly Ser 
110 H5 120 

CGT TGG GAA AGG AGC GAA AAA TCT AAC GCT GAT GCT GAA AAA TAT TAC 7 93 

Arg Trp Glu Arg Ser Glu Lys Ser Asn Ala Asp Ala Glu Lys Tyr Tyr 
125 130 135 140 

GAA GAG ATA CAA GAC AGA ATC AGG CGT TTG AAA GAA TCT AAA ATC TTT 841 
Glu Glu lie Gin Asp Arg He Arg Arg Leu Lys Glu Ser Lys He Phe 
145 150 155 

GAT TCG CAG TCT AGT AAT GAC CAA GAA TTG CAA AAA AGC GCT AAT AGC 889 
Asp Ser Gin Ser Ser Asn Asp Gin Glu Leu Gin Lys Ser Ala Asn Ser 
160 165 170 

AAC CTG GAT TTA GAC CCT ATC GGC AAC GCC ATG CCC CAA GCC TTA ATT 93 7 

Asn Leu Asp Leu Asp Pro He Gly Asn Ala Met Pro Gin Ala Leu He 
175 180 185 

GCC AAA GAA ACT AAA ATA GAA GAA ACC CAA GCA GAA AAA TCC CAA GAA 985 
Ala Lys Glu Thr Lys He Glu Glu Thr Gin Ala Glu Lys Ser Gin Glu 
190 195 200 
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ATG AAA GAG ACA ACT AGC GAG CAA ACA AAA AGT AAG CCA GAA AAA GCA 10 33 

Met Lys Glu Thr Thr Ser Glu Gin Thr Lys Ser Lys Pro Glu Lys Ala 
205 210 215 220 

AAA GAT AAA CCC ATG TAT TTG GCG CAA ATC AAC AGC ACT GAT TTC ACA 10 81 

Lys Asp Lys Pro Met Tyr Leu Ala Gin He Asn Ser Thr Asp Phe Thr 
225 230 235 

CCC GTT AAA AAA AGC CCC AAA AAA CCG GCT AAA GTG AGC CAA AAA CAC 112 9 

Pro Val Lys Lys Ser Pro Lys Lys Pro Ala Lys Val Ser Gin Lys His 
240 245 250 

TCC TTT AAG AAT AAC ATT AAA AAT AAT GTA AAA AAC AAC GCC AAA ACC 117 7 

Ser Phe Lys Asn Asn He Lys Asn Asn Val Lys Asn Asn Ala Lys Thr 
255 260 265 

GCT TCC AAA AAA CAA GAA ATG TGC AAA AAT TGC TCT CCA GGG CAA AGG 12 2 5 

Ala Ser Lys Lys Gin Glu Met Cys Lys Asn Cys Ser Pro Gly Gin Arg 
270 275 280 

AAT GCG ATT TTA GCT AAC CAC ATC ACT CTC ATG CAA GAG CTT TAAAAAGTC 12 7 6 
Asn Ala lie Leu Ala Asn His He Thr Leu Met Gin Glu Leu 
285 290 295 

CTAAAAATGG CGCAAAAAAC TCTTTTGATT ATCACTGATG GCATTGGGTA TCGTAAAGAT 13 3 6 

AGCGATCATA ACGCTTTCTT CCATGCCAAA AAACCCACTT ATGATTTGAT GTTTAAAACC 13 96 

TTGCCTTATA GCCTG ATTGA TACGCATGGC TTGAGCGTGG GCTTACCTAA GGGGCAAATG 14 5 6 

GGAAATTCTG AAGTGGGGCA T 14 7 7 

(2) INFORMATION FOR SEQ ID NO : 2 6 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 298 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Met Thr Asn Cys Asp Asn He Lys Asp Phe Asn Ala Lys Gin Lys 

1 5 10 15 

Glu Val Leu Lys Ala Ala Tyr Gin Phe Gly Ser Lys Glu Asn Leu Gly 

20 25 30 

Tyr Glu Met Ala Gly He Ala Trp Lys Glu Ser Cys Ala Gly Val Tyr 

35 40 45 

Lys He Asn Phe Ser Asp Pro Ser Ala Gly Val Tyr His Ser Tyr He 

50 55 60 

Pro Ser Val Leu Lys Ser Tyr Gly His Asn Asp Ser Pro Phe Leu Arg 

65 70 75 80 

Asn Val Met Gly Glu Leu Leu He Lys Asp Asp Ala Phe Ala Ser Glu 

85 90 95 

Val Ala Leu Lys Glu Leu Leu Tyr Trp Lys Thr Arg Tyr His Asp Asn 
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100 

Leu Lys Asp Met 
115 

Ser Glu Lys Ser 
130 

Asp Arg lie Arg 
145 

Ser Asn Asp Gin 

Asp Pro lie Gly 
180 

Lys lie Glu Glu 
195 

Thr Ser Glu Gin 
210 

Met Tyr Leu Ala 
225 

Ser Pro Lys Lys 

Asn lie Lys Asn 
260 

Gin Glu Met Cys 
275 

Ala Asn His lie 
290 



He Lys Ser Tyr 
120 

Asn Ala Asp Ala 
135 

Arg Leu Lys Glu 
150 

Glu Leu Gin Lys 
165 

Asn Ala Met Pro 

Thr Gin Ala Glu 
200 

Thr Lys Ser Lys 
215 

Gin He Asn Ser 
230 

Pro Ala Lys Val 
245 

Asn Val Lys Asn 

Lys Asn Cys Ser 
280 

Thr Leu Met Gin 

295 



105 

Asn Lys Gly Ser 

Glu Lys Tyr Tyr 
140 

Ser Lys He Phe 
155 

Ser Ala Asn Ser 
170 

Gin Ala Leu He 
185 

Lys Ser Gin Glu 

Pro Glu Lys Ala 
220 

Thr Asp Phe Thr 
235 

Ser Gin Lys His 
250 

Asn Ala Lys Thr 
265 

Pro Gly Gin Arg 
Glu Leu 



110 

Arg Trp Glu Arg 
125 

Glu Glu He Gin 

Asp Ser Gin Ser 
160 

Asn Leu Asp Leu 
175 

Ala Lys Glu Thr 
190 

Met Lys Glu Thr 
205 

Lys Asp Lys Pro 

Pro Val Lys Lys 
240 

Ser Phe Lys Asn 
255 

Ala Ser Lys Lys 
270 

Asn Ala He Leu 
285 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 141... 1340 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTAGTGTTGA TTTTTTTATC GTTAGTGTTT GTGCGTCCTT TAG AGG CTTT GAGCGTGTTT 
ATGGGGTTGT ATTTGATTTA TGGCATCATT CGGTGGCTCT TTTTAATGGT AAAAATTATT 
TTTAATAAAA ATAAAAGCGC ATG AAA GAA TCT TTT TAC ATA GAG GGA ATG 

Met Lys Glu Ser Phe Tyr He Glu Gly Met 
15 10 



ACT TGC ACG GCG TGT TCT AGC GGG ATT GAA CGC TCT TTG GGG CGT AAG 218 
Thr Cys Thr Ala Cys Ser Ser Gly He Glu Arg Ser Leu Gly Arg Lys 
15 20 25 

AGT TTT GTG AAA AAA ATA GAA GTG AGC CTT TTA AAT AAG AGC GCT AAC 2 66 

Ser Phe Val Lys Lys He Glu Val Ser Leu Leu Asn Lys Ser Ala Asn 
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30 35 40 

ATT GAA TTT GAC GAA AAC CAA ACC AAT TTA GAC GAA ATT TTT AAA CTC 314 
lie Glu Phe Asp Glu Asn Gin Thr Asn Leu Asp Glu lie Phe Lys Leu 
45 50 55 

ATT GAA AAG CTA GGC TAT AGC CCT AAA AAA GCT CTG ACA AAA GAA AAA 3 62 

lie Glu Lys Leu Gly Tyr Ser Pro Lys Lys Ala Leu Thr Lys Glu Lys 
60 65 70 

AAA GAA TTT TTT AGC CCT AAT GTT AAA TTA GCG TTA GCG GTT ATT TTC 410 
Lys Glu Phe Phe Ser Pro Asn Val Lys Leu Ala Leu Ala Val lie Phe 
75 80 85 90 

ACG CTT TTT GTG GTG TAT CTT TCT ATG GGG GCG ATG CTT AGC CCT AGC 45 8 

Thr Leu Phe Val Val Tyr Leu Ser Met Gly Ala Met Leu Ser Pro Ser 
95 100 105 

CTT TTA CCT GAA AGC TTG CTT GCA ATT GAT AAT CAT AGT AAT TTT TTA 5 06 

Leu Leu Pro Glu Ser Leu Leu Ala lie Asp Asn His Ser Asn Phe Leu 
110 115 120 

AAC GCT TGC TTA CAG CTT ATA GGC GCA CTC ATT GTC ATG CAT TTG GGG 5 54 

Asn Ala Cys Leu Gin Leu He Gly Ala Leu He Val Met His Leu Gly 
125 130 135 

AGG GAT TTT TAC ATT CAA GGG TTT AAA GCC TTA TGG CAC AGA CAA CCC 6 02 

Arg Asp Phe Tyr He Gin Gly Phe Lys Ala Leu Trp His Arg Gin Pro 
140 145 150 

AAC ATG AGC AGC CTT ATC GCC ATA GGC ACA AGC GCT GCC TTA ATT TCA 65 0 

Asn Met Ser Ser Leu He Ala lie Gly Thr Ser Ala Ala Leu He Ser 
155 160 165 170 

AGC CTG TGG CAA TTG TAT TTG GTC TAT ACC AAT CAT TAT ACC GAT CAG 69 8 

Ser Leu Trp Gin Leu Tyr Leu Val Tyr Thr Asn His Tyr Thr Asp Gin 
175 180 185 

TGG TCT TAT GGG CAT TAT TAT TTT GAA AGC GTG TGC GTG ATT TTA ATG 74 6 

Trp Ser Tyr Gly His Tyr Tyr Phe Glu Ser Val Cys Val He Leu Met 
190 195 200 

TTT GTG ATG GTG GGC AAA CGC ATT GAA AAT GTT TCT AAA GAC AAA GCT 7 94 

Phe Val Met Val Gly Lys Arg He Glu Asn Val Ser Lys Asp Lys Ala 
205 210 215 

TTA GAC GCT ATG CAA GCC TTG ATG AAA AAC GCC CCA AAA ACC GCC CTT 84 2 

Leu Asp Ala Met Gin Ala Leu Met Lys Asn Ala Pro Lys Thr Ala Leu 
220 225 230 

AAA ATG CAA AAT AAC CAA CAG ATT GAA GTT TTA GTG GAT AGC ATT GTG 8 90 

Lys Met Gin Asn Asn Gin Gin lie Glu Val Leu Val Asp Ser He Val 
235 240 245 250 

GTG GGG GAT ATT CTA AAA GTC CTC CCT GGA AGC GCG ATT GCG GTG GAT 93 8 
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Val Gly Asp 



GGT GAA ATC 
Gly Glu He 



GGC GAA GCG 
Gly Glu Ala 
285 

GGG ACA TTC 
Gly Thr Phe 
300 

AAC AAA AAC 
Asn Lys Asn 
315 

CAA AGT TCA 
Gin Ser Ser 



GTG TTT GTG 
Val Phe Val 



He Leu Lys 
255 

ATA GAG GGC 
He Glu Gly 
270 

TTG CCG GTT 
Leu Pro Val 



AAT AGC CAC 
Asn Ser His 



AGC ACC TTG 
Ser Thr Leu 
320 

AAG GCA GAG 
Lys Ala Glu 
335 

CCA AGC GTG 
Pro Ser Val 
350 



Val Leu Pro 



GAA GGG GAA 
Glu Gly Glu 
275 

TAT AAA AAA 
Tyr Lys Lys 
290 

ACG AGT TTT 
Thr Ser Phe 
305 

TCT CAA ATT 
Ser Gin He 



ATT TCT CGC 
He Ser Arg 



ATC GCT ATT 
He Ala He 
355 



Gly Ser Ala 
260 

TTA GAT GAG 
Leu Asp Glu 

GTC GGC GAT 
Val Gly Asp 



TTA ATG AAA 
Leu Met Lys 
310 

ATA GAA ATG 
He Glu Met 
325 

TTA GCG GAT 

Leu Ala Asp 
340 

TCT ATT TTA 

Ser He Leu 



He Ala Val 
265 

AGC ATG TTG 
Ser Met Leu 
280 

AAA GTC TTT 
Lys Val Phe 
295 

GCC ACG CAA 
Ala Thr Gin 



ATT TAT AAC 
He Tyr Asn 



AAG GTT TCA 
Lys val Ser 
345 

GCG TTT GTG 
Ala Phe Val 
360 



Asp 



AGC 986 
Ser 



TCA 1034 
Ser 



AAC 1082 
Asn 



GCT 113 0 

Ala 

330 

AGC 117 8 

Ser 



GTG 122 6 

Val 



TGG CTC ATC ATT GCA CCT AAG CCC GAT TTT TGG TGG AAT TTT GGA ATC 12 74 

Trp Leu He He Ala Pro Lys Pro Asp Phe Trp Trp Asn Phe Gly He 

365 370 375 

GCT TTA GAA GTG TTT GTA TCG GTT TTA GTG ATT TCT TGC CCT TGC GCT 132 2 

Ala Leu Glu Val Phe Val Ser Val Leu Val He Ser Cys Pro Cys Ala 

380 385 390 

TTA GGA TTG CTA CGC CTA TGAGCATTTT AG TAG CG AAC CAGAAAGCGA GTTCTTTA 13 7 8 
Leu Gly Leu Leu Arg Leu 
395 400 

GGGTTATTTT TTAAAG ACG C TAAAAGTTTA GAAAAAGCAA GGCTAGTCAA TACGATCGTT 14 3 8 
TTTGATAAAA CCGGCACGCT CACTAACGGC AAGCCTGTCG TTAAAAGCGT TCATTCTAAG 14 9 8 
ATAGAATTAT TAGAGTT 1515 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 : 
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Met Lys Glu Ser Phe Tyr lie Glu Gly Met Thr Cys Thr Ala Cys Ser 

X 5 10 15 

Ser Gly lie Glu Arg Ser Leu Gly Arg Lys Ser Phe Val Lys Lys lie 

20 25 30 

Glu Val Ser Leu Leu Asn Lys Ser Ala Asn lie Glu Phe Asp Glu Asn 

35 40 45 

Gin Thr Asn Leu Asp Glu lie Phe Lys Leu lie Glu Lys Leu Gly Tyr 

50 55 60 

Ser Pro Lys Lys Ala Leu Thr Lys Glu Lys Lys Glu Phe Phe Ser Pro 
65 70 75 80 

Asn Val Lys Leu Ala Leu Ala Val lie Phe Thr Leu Phe Val Val Tyr 

85 90 95 

Leu Ser Met Gly Ala Met Leu Ser Pro Ser Leu Leu Pro Glu Ser Leu 

100 105 110 

Leu Ala lie Asp Asn His Ser Asn Phe Leu Asn Ala Cys Leu Gin Leu 

115 120 125 

lie Gly Ala Leu lie Val Met His Leu Gly Arg Asp Phe Tyr lie Gin 

130 135 140 

Gly Phe Lys Ala Leu Trp His Arg Gin Pro Asn Met Ser Ser Leu lie 
145 150 155 160 

Ala lie Gly Thr Ser Ala Ala Leu lie Ser Ser Leu Trp Gin Leu Tyr 

165 170 175 

Leu Val Tyr Thr Asn His Tyr Thr Asp Gin Trp Ser Tyr Gly His Tyr 

180 185 190 

Tyr Phe Glu Ser Val Cys Val lie Leu Met Phe Val Met Val Gly Lys 

195 200 205 

Arg lie Glu Asn Val Ser Lys Asp Lys Ala Leu Asp Ala Met Gin Ala 

210 215 220 

Leu Met Lys Asn Ala Pro Lys Thr Ala Leu Lys Met Gin Asn Asn Gin 
225 230 235 240 

Gin lie Glu Val Leu Val Asp Ser lie Val Val Gly Asp He Leu Lys 

245 250 255 

Val Leu Pro Gly Ser Ala He Ala Val Asp Gly Glu lie He Glu Gly 

260 265 270 

Glu Gly Glu Leu Asp Glu Ser Met Leu Ser Gly Glu Ala Leu Pro Val 

275 280 285 

Tyr Lys Lys Val Gly Asp Lys Val Phe Ser Gly Thr Phe Asn Ser His 

290 295 300 

Thr Ser Phe Leu Met Lys Ala Thr Gin Asn Asn Lys Asn Ser Thr Leu 
305 310 315 320 

Ser Gin lie He Glu Met He Tyr Asn Ala Gin Ser Ser Lys Ala Glu 

325 330 335 

He Ser Arg Leu Ala Asp Lys Val Ser Ser Val Phe Val Pro Ser Val 

340 345 350 

He Ala He Ser He Leu Ala Phe Val Val Trp Leu He He Ala Pro 

355 360 365 

Lys Pro Asp Phe Trp Trp Asn Phe Gly He Ala Leu Glu Val Phe Val 

370 375 380 

Ser Val Leu Val lie Ser Cys Pro Cys Ala Leu Gly Leu Leu Arg Leu 
385 390 395 400 

(2) INFORMATION FOR SEQ ID NO: 29: 

.(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1443 base pairs 
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(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 76... 1389 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

ACTTTAAAAA ACCCCCTTAA AAAGGTTTTT AGGTATAATT AGCGATCTTT TAGTTTCAAA 60 
TAGTAGAGAG ATGGG ATG AAA AAA ATA TGG CTT TTA GTG TGG GGC TTG TGT 111 
Met Lys Lys He Trp Leu Leu Val Trp Gly Leu Cys 
15 10 

TCT TGG GTG TTT TTG CAT GCG ATA GAG ATG ATA GAA AAA GCC CCT ACA 15 9 

Ser Trp Val Phe Leu His Ala He Glu Met He Glu Lys Ala Pro Thr 
15 20 25 

AAT GTA GAG GAT AGA GAC AAA GCC CCC CAT TTG TTG CTT TTA GCA GGG 2 07 

Asn Val Glu Asp Arg Asp Lys Ala Pro His Leu Leu Leu Leu Ala Gly 
30 35 40 

ATT CAA GGC GAT GAG CCT GGT GGG TTT AAT GCA ACT AAT TTG TTT TTA 255 
He Gin Gly Asp Glu Pro Gly Gly Phe Asn Ala Thr Asn Leu Phe Leu 
45 50 55 60 

ATG CAT TAT AGC GTT TTA AAA GGT TTG GTT GAA GTG GTT CCT GTA TTG 3 03 

Met His Tyr Ser Val Leu Lys Gly Leu Val Glu Val Val Pro Val Leu 
65 70 75 

AAT AAG CCT TCC ATG TTA AGA AAT CAT AGG GGC TTG TAT GGG GAT ATG 351 
Asn Lys Pro Ser Met Leu Arg Asn His Arg Gly Leu Tyr- Gly Asp Met 
80 85 90 

AAC CGC AAA TTT GCC GCT TTA GAC AAG AAT GAC CCT GAA TAC CCC ACT 3 99 

Asn Arg Lys Phe Ala Ala Leu Asp Lys Asn Asp Pro Glu Tyr Pro Thr 
95 100 105 

ATC CAG GAA ATC AAA TCC TTG ATT GCA AAA CCC AGT ATA GAC GCT GTC 44 7 

He Gin Glu He Lys Ser Leu He Ala Lys Pro Ser He Asp Ala Val 
110 H5 120 

TTG CAT TTG CAT GAT GGC GGT GGG TAT TAC CGC CCT GTT TAT GTT GAT 4 95 

Leu His Leu His Asp Gly Gly Gly Tyr Tyr Arg Pro Val Tyr Val Asp 
125 130 135 140 

GCG ATG CTC AAT CCT AAG CGC TGG GGG AAT TGC TTT ATT ATT GAT CAA 54 3 

Ala Met Leu Asn Pro Lys Arg Trp Gly Asn Cys Phe He He Asp Gin 
145 150 155 
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GAT GAG GTT 
Asp Glu Val 



AAT ACG ATT 
Asn Thr He 
175 

TAT CAT TTA 
Tyr His Leu 
190 

AAA GCC CTA 
Lys Ala Leu 
205 

AAT GAA GCT 
Asn Glu Ala 



CTG CAA GCC 
Leu Gin Ala 



CGC GAT TTT 
Arg Asp Phe 
255 

AAA AAC TTG 
Lys Asn Leu 
270 

AAC TTG CGC 
Asn Leu Arg 
285 

ATC CCA CAA 
He Pro Gin 



AAA AAT AAA 
Lys Asn Lys 

CGA TTA TCG 
Arg Leu Ser 
335 

AAA ATG CAA 
Lys Met Gin 
350 

GTT GAA GTG 
Val Glu Val 
365 



AAA GGG GCG 
Lys Gly Ala 
160 

GAG AGT ATC 
Glu Ser He 



AAA AAC ACG 
Lys Asn Thr 



ACT TTT TAT 
Thr Phe Tyr 
210 

AGC AAA GAA 
Ser Lys Glu 
225 

ATT GAG GGC 
lie Glu Gly 
240 

GAT CTT AAC 
Asp Leu Asn 

TGG GCA AAA 
Trp Ala Lys 

CCT AAA CTC 
Pro Lys Leu 
290 

ATC CCC ATA 
lie Pro He 
305 

CAA GAA GTG 
Gin Glu Val 
320 

CCT TTT TAC 
Pro Phe Tyr 



ATT GAC AAT 
He Asp Asn 



AAA GAG AGT 
Lys Glu Ser 
370 



AAA TTC CCT 
Lys Phe Pro 
165 

AAC GCC CAT 
Asn Ala His 
180 

CGC ACC GCG 
Arg Thr Ala 
195 

GCG ATC AAC 
Ala He Asn 



CTC CCT TTA 
Leu Pro Leu 



TTA CTC AAT 
Leu Leu Asn 
245 

CCT AAC AGC 
Pro Asn Ser 
260 

ATC AGC TCT 
He Ser Ser 
275 

AAT CAT TTC 
Asn His Phe 



GAG AGC AAC 
Glu Ser Asn 



TTT TTA AAA 
Phe Leu Lys 
325 

ATA GAG TTT 
He Glu Phe 
340 

AAG GAT CAA 
Lys Asp Gin 
355 

TTT TAT ATC 
Phe Tyr He 
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AAT TTG CTT 
Asn Leu Leu 



TTA TTG CAC 
Leu Leu His 



CAA GGC GAT 
Gin Gly Asp 
200 

CAA AAA AAG 
Gin Lys Lys 
215 

GCA TCA AGG 

Ala Ser Arg 
230 

CAG CTC AAT 

Gin Leu Asn 



GTG CAT GCC 
Val His Ala 



TTG CCT AAA 
Leu Pro Lys 
280 

CCC TTA CCC 
Pro Leu Pro 
295 

GCT TAC ATT 
Ala Tyr He 
310 

TAC GGC AAC 
Tyr Gly Asn 

GAT CCT TCT 
Asp Pro Ser 



ATG GTT AAA 
Met Val Lys 
360 

CAT GCT ATG 
His Ala Met 
375 



GCT TTT GCA 

Ala Phe Ala 
170 

CCC ATT GAA 

Pro He Glu 
185 

ACA GAA ATG 

Thr Glu Met 



AGC GCT TTT 
Ser Ala Phe 



GTG TTT TAC 
Val Phe Tyr 
235 

ATC CCT TTT 
He Pro Phe 
250 

CTA ATC AAT 
Leu He Asn 
265 

ATG CCC CTT 
Met Pro Leu 



CAC AAC ACT 
His Asn Thr 



GTA GGG CTA 
Val Gly Leu 
315 

AAG CTC ATG 
Lys Leu Met 
330 

TTA GAA GAA 
Leu Glu Glu 
345 

ATA GGG AGC 
He Gly Ser 

GAC AAT ATC 
Asp Asn He 



AAC 591 
Asn 



GAG 6 3 9 

Glu 



CAA 687 
Gin 



GCC 735 

Ala 

220 

CAC 7 83 

His 



AAG 8.3 1 
Lys 

GAT 8 7 9 
Asp 



TTT 92 7 

Phe 



AAA 975 

Lys 

300 

GTC 1023 
Val 



ACA 1071 
Thr 



GTG 1119 
Val 



GTG 1167 
Val 



CGT 1215 

Arg 

380 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT7US97/21353 



-118- 

GCG AAT GTG ATT GGC TTT AGC GTT TCT AAT GAA AAT AAG CCT AAT GAA 12 63 

Ala Asn Val lie Gly Phe Ser Val Ser Asn Glu Asn Lys Pro Asn Glu 
385 390 395 

GCG GGT TAT ACG ATT AAA TTT AAA GAT TTT CAA AAA CGC TTT TCA TTG 1311 
Ala Gly Tyr Thr He Lys Phe Lys Asp Phe Gin Lys Arg Phe Ser Leu 
400 405 410 

GAC AAG CAA GAA AGG ATC TAT CGC ATA GAA TTT TAT AAA AAC AAC GCG 13 5 9 

Asp Lys Gin Glu Arg He Tyr Arg He Glu Phe Tyr Lys Asn Asn Ala 
415 420 425 

TTT AGC GGG ATG ATC TTA GTG AAA TTT GTG TAGGAATGGA TAAATCTCAT TGC 1412 
Phe Ser Gly Met He Leu Val Lys Phe Val 
430 435 

CTTTTAACAT TCAAGGGTTT TGGTATTTTT T 144 3 

(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Lys Lys lie Trp Leu Leu Val Trp Gly Leu Cys Ser Trp Val Phe 

15 10 15 

Leu His Ala He Glu Met He Glu Lys Ala Pro Thr Asn Val Glu Asp 

20 25 30 

Arg Asp Lys Ala Pro His Leu Leu Leu Leu Ala Gly He Gin Gly Asp 

35 40 45 

Glu Pro Gly Gly Phe Asn Ala Thr Asn Leu Phe Leu Met His Tyr Ser 

50 55 60 

Val Leu Lys Gly Leu Val Glu Val Val Pro Val Leu Asn Lys Pro Ser 
65 70 75 80 

Met Leu Arg Asn His Arg Gly Leu Tyr Gly Asp Met Asn Arg Lys Phe 

85 90 95 

Ala Ala Leu Asp Lys Asn Asp Pro Glu Tyr Pro Thr He Gin Glu He 

100 105 HO 

Lys Ser Leu He Ala Lys Pro Ser He Asp Ala Val Leu His Leu His 

115 120 125 

Asp Gly Gly Gly Tyr Tyr Arg Pro Val Tyr Val Asp Ala Met Leu Asn 

130 135 140 

Pro Lys Arg Trp Gly Asn Cys Phe He He Asp Gin Asp Glu Val Lys 
145 150 155 160 

Gly Ala Lys Phe Pro Asn Leu Leu Ala Phe Ala Asn Asn Thr lie Glu 

165 170 175 

Ser He Asn Ala His Leu Leu His Pro lie Glu Glu Tyr His Leu Lys 
180 185 190 
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Asn Thr Arg Thr Ala Gin Gly Asp Thr Glu Met Gin Lys Ala Leu Thr 

195 200 205 

Phe Tyr Ala lie Asn Gin Lys Lys Ser Ala Phe Ala Asn Glu Ala Ser 

210 215 220 

Lys Glu Leu Pro Leu Ala Ser Arg Val Phe Tyr His Leu Gin Ala lie 
225 230 235 240 

Glu Gly Leu Leu Asn Gin Leu Asn lie Pro Phe Lys Arg Asp Phe Asp 

245 250 255 

Leu Asn Pro Asn Ser Val His Ala Leu lie Asn Asp Lys Asn Leu Trp 

260 265 270 

Ala Lys lie Ser Ser Leu Pro Lys Met Pro Leu Phe Asn Leu Arg Pro 

275 280 285 

Lys Leu Asn His Phe Pro Leu Pro His Asn Thr Lys lie Pro Gin lie 

290 295 300 

Pro lie Glu Ser Asn Ala Tyr lie Val Gly Leu Val Lys Asn Lys Gin 
305 310 315 320 

Glu Val Phe Leu Lys Tyr Gly Asn Lys Leu Met Thr Arg Leu Ser Pro 

325 330 335 

Phe Tyr lie Glu Phe Asp Pro Ser Leu Glu Glu Val Lys Met Gin lie 

340 345 350 

Asp Asn Lys Asp Gin Met Val Lys lie Gly Ser Val Val Glu Val Lys 

355 360 365 

Glu Ser Phe Tyr lie His Ala Met Asp Asn lie Arg Ala Asn Val lie 

370 375 380 

Gly Phe Ser Val Ser Asn Glu Asn Lys Pro Asn Glu Ala Gly Tyr Thr 
385 390 395 400 

He Lys Phe Lys Asp Phe Gin Lys Arg Phe Ser Leu Asp Lys Gin Glu 

405 410 415 

Arg He Tyr Arg He Glu Phe Tyr Lys Asn Asn Ala Phe Ser Gly Met 

420 425 430 

He Leu Val Lys Phe Val 
435 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 66... 1223 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 

ATCAATACCC CTTAAATAAA AGATATAATG CTGTATTATA AGCTAGTTTT AATTACAATT 6 0 

TTCAA ATG TTA AGG AAA AAC ATT TTA GCT TAC TAT GGG GCG AAT TTT CTC 110 
Met Leu Arg Lys Asn He Leu Ala Tyr Tyr Gly Ala Asn Phe Leu 
1-5 10 15 
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TTA ATC ATC GCT CAA AGC TTA CCC CAT GCG ATT TTA ACC CCC TTG TTG 15 8 

Leu lie lie Ala Gin Ser Leu Pro His Ala lie Leu Thr Pro Leu Leu 
20 25 30 

CTT TCT AAA GGG CTT AGT TTG AGT GAA ATC TTG CTC GTG CAA ACC TTT 2 06 

Leu Ser Lys Gly Leu Ser Leu Ser Glu lie Leu Leu Val Gin Thr Phe 
35 40 45 

TTT AGC TTT TGC GTG CTA GTG GCT GAA TAC CCA AGC GGC GTT TTA GCG 2 54 

Phe Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu Ala 
50 55 60 

GAT TTG ATG AGC CGA AAA AAT TTA TTC CTG GTT TCT AAT GCC TTT TTA 3 02 

Asp Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe Leu 
65 70 75 

ATC GCT AGT TTT TCG TTT GTG CTG TTT TTT GAT AGC TTT ATT TTC ATG 35 0 

He Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe He Phe Met 
80 85 90 95 

CTT TTA GCG TGG GGG TTG TAT GGT TTG TAT AGC GCA TGC TCT AGC GGC 3 98 

Leu Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser Gly 
100 105 HO 

ACG ATT GAA GCT TCA CTC ATC ACA GAC ATT AAG GAA AAC AAA AAA GAT 44 6 

Thr He Glu Ala Ser Leu He Thr Asp He Lys Glu Asn Lys Lys Asp 
115 120 125 

TTA TCC AAG TTT TTA GCC AAA AAC AAT CAA ATT ACT TAT TTA GGC ATG 4 94 

Leu Ser Lys Phe Leu Ala Lys Asn Asn Gin He Thr Tyr Leu Gly Met 
130 135 140 

ATT ATA GGG AGT TCT TTG GGA TCG TTT TTG TAT CTC AAA GTC CAT GCG 54 2 

lie He Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His Ala 
145 150 155 

ATG CTG TAT ATT GTG GGG ATT TTT TTA ATC ATG CTC TGT GTG CTA ACG 5 90 

Met Leu Tyr He Val Gly He Phe Leu He Met Leu Cys Val Leu Thr 
160 165 170 175 

ATC ATT TTT TAT TTT AAA GAG AAA GAA GGG GAT TTT AAA AGC CAA AAA 63 8 

He He Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin Lys 
180 185 190 

AGC CTG AAA CTC CTT AAA GAG CAA GTC AAA GGC AGT CTT AAA GAG CTT 686 
Ser Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu Leu 
195 200 205 

AAA GAT AAC CCC AAA CTT AAA ATT CTG TTA GTG GGG CAT TTG ATT ACG 734 
Lys Asp Asn Pro Lys Leu Lys He Leu Leu Val Gly His Leu He Thr 
210 215 220 

CCC GTC TTT TTT ATG AGC CAT TTT CAA ATG TGG CAA GCG TAT TTT TTA 7 82 

Pro Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe Leu 
225 230 235 
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AAA CAA GGC GTT AAA GAG CAA TAC CTT TTT GTG TTT TAT ATC GCT TTT 83 0 

Lys Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr lie Ala Phe 
240 245 250 255 

CAA GTG ATT TCT ATT CTC ATT CAT TTT TTA AAA GCC TCT AGT TAT AGC 87 8 

Gin Val lie Ser lie Leu lie His Phe Leu Lys Ala Ser Ser Tyr Ser 
260 265 270 

CAA AAA ATC GCC TTG AGT TCG CTT GTG GTG TTG TTA GGC GTT AGC CCC 92 6 

Gin Lys lie Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser Pro 
275 280 285 

TTA TTG CTT AGC AAT ATC CCT TAT TGT TTC ATA GGG GTG TAT GCG CTC 974 

Leu Leu Leu Ser Asn lie Pro Tyr Cys Phe lie Gly Val Tyr Ala Leu 

290 295 300 

ATG GTG GCG TTT TTC ACT TAC ATG AGC TAT TGC TTA AAC TAT CAA TTC 102 2 

Met Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin Phe 
305 310 315 

TCC AAA TTC GTT TCT AAA AAC AAC ATT TCC TCG CTC TCA TCG CTT TTA 107 0 

Ser Lys Phe Val Ser Lys Asn Asn lie Ser Ser Leu Ser Ser Leu Leu 
320 325 330 335 

TCA AGC TGT GTG CGC GTG GTC TCT GTG CTA ATC TTA TCG CTC AGC AGT 1118 
Ser Ser Cys Val Arg Val Val Ser Val Leu lie Leu Ser Leu Ser Ser 
340 345 350 

CTG GAA CTG CGT TAC TTC TCA CCC CTA ACT ATC ATA ACC ATG CAT TTT 1166 
Leu Glu Leu Arg Tyr Phe Ser Pro Leu Thr lie lie Thr Met His Phe 
355 360 365 

GCC TTG ACG CTT ATC ATC CTC TTT TTC TTT TTG TAT AAG GCT AAG CCG 1214 
Ala Leu Thr Leu lie lie Leu Phe Phe Phe Leu Tyr Lys Ala Lys Pro 
370 375 380 

TTT GAT GAG TGAGCGGCTT TAAGAGTGCA AC CTTTT AG C GATTTCTATA GCAACATCA 12 7 2 
Phe Asp Glu 
385 

TAG CC ATG 12 8 0 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 32: 
Met Leu Arg Lys Asn lie Leu Ala Tyr Tyr Gly Ala Asn Phe Leu Leu 
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1 5 10 15 

lie lie Ala Gin Ser Leu Pro His Ala lie Leu Thr Pro Leu Leu Leu 

20 25 30 

Ser Lys Gly Leu Ser Leu Ser Glu He Leu Leu Val Gin Thr Phe Phe 

35 40 45 

Ser Phe Cys Val Leu Val Ala Glu Tyr Pro Ser Gly Val Leu Ala Asp 

50 55 60 

Leu Met Ser Arg Lys Asn Leu Phe Leu Val Ser Asn Ala Phe Leu He 
65 70 75 80 

Ala Ser Phe Ser Phe Val Leu Phe Phe Asp Ser Phe He Phe Met Leu 

85 90 95 

Leu Ala Trp Gly Leu Tyr Gly Leu Tyr Ser Ala Cys Ser Ser Gly Thr 

100 105 HO 

He Glu Ala Ser Leu He Thr Asp He Lys Glu Asn Lys Lys Asp Leu 

115 120 125 

Ser Lys Phe Leu Ala Lys Asn Asn Gin He Thr Tyr Leu Gly Met He 

130 135 140 

He Gly Ser Ser Leu Gly Ser Phe Leu Tyr Leu Lys Val His Ala Met 
145 150 155 160 

Leu Tyr He Val Gly He Phe Leu He Met Leu Cys Val Leu Thr He 

165 170 175 

He Phe Tyr Phe Lys Glu Lys Glu Gly Asp Phe Lys Ser Gin Lys Ser 

180 185 190 

Leu Lys Leu Leu Lys Glu Gin Val Lys Gly Ser Leu Lys Glu Leu Lys 

195 200 205 

Asp Asn Pro Lys Leu Lys He Leu Leu Val Gly His Leu He Thr Pro 

210 215 220 

Val Phe Phe Met Ser His Phe Gin Met Trp Gin Ala Tyr Phe Leu Lys 
225 230 235 240 

Gin Gly Val Lys Glu Gin Tyr Leu Phe Val Phe Tyr He Ala Phe Gin 

245 250 255 

Val He Ser He Leu He His Phe Leu Lys Ala Ser Ser Tyr Ser Gin 

260 265 270 

Lys He Ala Leu Ser Ser Leu Val Val Leu Leu Gly Val Ser Pro Leu 

275 280 285 

Leu Leu Ser Asn He Pro Tyr Cys Phe lie Gly Val Tyr Ala Leu Met 

290 295 300 

Val Ala Phe Phe Thr Tyr Met Ser Tyr Cys Leu Asn Tyr Gin Phe Ser 
305 310 315 320 

Lys Phe Val Ser Lys Asn Asn lie Ser Ser Leu Ser Ser Leu Leu Ser 

325 330 335 

Ser Cys Val Arg Val Val Ser Val Leu He Leu Ser Leu Ser Ser Leu 

340 345 350 

Glu Leu Arg Tyr Phe Ser Pro Leu Thr He He Thr Met His Phe Ala 

355 360 365 

Leu Thr Leu He He Leu Phe Phe Phe Leu Tyr Lys Ala Lys Pro Phe 

370 375 380 

Asp Glu 
385 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1264 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 51... 1205 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 
ATTAAATATG ACTATATACA CTACAACAAT AAGATTTTGA AAGGTTGGTA ATG GAA 5 6 

Met Glu 
1 

TCA GTA AAA ACA GGA AAA ACA AAT AAG GTT GGC AAG AAT ACA GAG ATG 104 
Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys Asn Thr Glu Met 
5 10 15 

GCT AAT ACA AAG GCA AAT AAA GAG GCT CAT TTT AAA CAA GCG AGC ACC 1.5 2 

Ala Asn Thr Lys Ala Asn Lys Glu Ala His Phe Lys Gin Ala Ser Thr 
20 25 30 

ATT ACA AAT ATA ATC AGA TCA ATT CGT GGG ATT TTT ACA AAA ATT GCA 200 
lie Thr Asn lie lie Arg Ser lie Arg Gly lie Phe Thr Lys lie Ala 
35 40 45 50 

AAG AAA GTT AGA GGA CTT GTA AAA AAA CAC CCC AAG AAA AGC AGT GCG 24 8 

Lys Lys Val Arg Gly Leu Val Lys Lys His Pro Lys Lys Ser Ser Ala 
55 60 65 

GCA TTA GTA GTA TTG ACC CAT ATT GCG TGC AAG AAA GCG AAA GAA TTA 2 96 

Ala Leu Val Val Leu Thr His lie Ala Cys Lys Lys Ala Lys Glu Leu 
70 75 80 

GAC GAT AAA GTC CAA GAT AAA TCC AAA CAA GCT GAA AAA GAA AAT CAA 3 44 

Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn Gin 
85 90 95 

ATC AAT TGG TGG AAA TAT TCA GGA TTA ACA ATA GCG ACA AGT TTA TTA 3 92 

lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Thr Ser Leu Leu 
100 105 110 

TTA GCC GCT TGT AGC ACT GGT GAT GTT AGT GAA CAA ATA GAA CTA GAA 44 0 

Leu Ala Ala Cys Ser Thr Gly Asp Val Ser Glu Gin lie Glu Leu Glu 
115 120 125 130 

CAA GAA AAA CAA AAG ACG AGC AAT ATA GAG ACT AAC AAT CAA ATA AAA 4 88 

Gin Glu Lys Gin Lys Thr Ser Asn lie Glu Thr Asn Asn Gin lie Lys 
135 140 145 

GTA GAA CAA GAA AAA CAA AAG ACA AGC AAT ATA GAG ACT AAT AAT CAA 53 6 

Val Glu Gin Glu Lys Gin Lys Thr Ser Asn lie Glu Thr Asn Asn Gin 
150 155 160 
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ATA AAA GTA GAA CAA GAA CAA CAG AAA AC A GAA CAA GAA MGA CAG AAA 5 84 

He Lys Val Glu Gin Glu Gin Gin Lys Thr Glu Gin Glu Xaa Gin Lys 
165 170 175 

ACA GAA CAA GAA AGA CAG AAG ACA GAA CAA GAA AAA CAA AAG ACC ATT 63 2 

Thr Glu Gin Glu Arg Gin Lys Thr Glu Gin Glu Lys Gin Lys Thr He 
180 185 190 

AAA ACA CAG AAA GAT TTC ATT AAA TAT GTA GAA CAA AAT TGC CAA GAA 68 0 

Lys Thr Gin Lys Asp Phe He Lys Tyr Val Glu Gin Asn Cys Gin Glu 
X95 200 205 210 

AAT CAT AAT CAA TTC TTT ATT GAA AAA GGA GGA ATT AAG GCT GGT ATT 72 8 

Asn His Asn Gin Phe Phe He Glu Lys Gly Gly He Lys Ala Gly He 
215 220 225 

GGT ATA GAA GTA GAA GCT GAA TGC AAA ACC CCT AAA CCT GCA AAA ACC 77 6 

Gly He Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr 
230 235 240 

AAT CAA ACC CCT ATC CAG CCA AAA CAC CTC CCA AAC TCT AAA CAA CCC 824 
Asn Gin Thr Pro He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro 
245 250 255 

CGC TCT CAA AGA GGA TCA AAA GCG CAA GAG CTT ATC GCT TAT TTG CAA 87 2 

Arg Ser Gin Arg Gly Ser Lys Ala Gin Glu Leu He Ala Tyr Leu Gin 
260 265 270 

AAA GAG CTA GAA TCT CTG CCC TAT TCA CAA AAA GCT ATC GCT AAA CAA 92 0 

Lys Glu Leu Glu Ser Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin 
275 280 285 290 

GTG GAT TTT TAT AGA CCA AGT TCT ATC GCT TAT TTA GAA CTA GAC CCT 96 8 

Val Asp Phe Tyr Arg Pro Ser Ser He Ala Tyr Leu Glu Leu Asp Pro 
295 300 305 

AGA GAT TTT AAT GTT ACA GAA GAA TGG CAA AAA GAA AAT TTA AAA ATA 1016 
Arg Asp Phe Asn Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He 
310 315 320 

CGC TCT AAA GCT CAA GCT AAA ATG CTT GAA ATG AGG AGT TTA AAA CCA 1064 
Arg Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Ser Leu Lys Pro 
325 330 335 

GAC TCA CAA GCC CAC CTT TCA ACC TCT CAA AGC CTT TTG TTC GTT CAA 1112 
Asp Ser Gin Ala His Leu Ser Thr Ser Gin Ser Leu Leu Phe Val Gin 
340 345 350 

AAA ATA TTT GCT GAT GTT AAT AAA GAA ATA AAA GTA GTT GCT AAT ACT 116 0 

Lys He Phe Ala Asp Val Asn Lys Glu He Lys Val Val Ala Asn Thr 
355 360 365 370 

GAA AAG AAA GCA GAA AAA GCG GGT TAT GGT TAT AGT AAA AGG ATG TAGGC 1210 
Glu Lys Lys Ala Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 
375 380 385 
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ATAAGAAAAC AC C ATAAAAT CGTTCTTAGC TTATTTATAG TATTTTAAAA ACTC 12 64 

(2) INFORMATION FOR SEQ ID NO:34: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



Met Glu Ser Val 
1 

Glu Met Ala Asn 
20 

Ser Thr lie Thr 
35 

lie Ala Lys Lys 
50 

Ser Ala Ala Leu 

65 

Glu Leu Asp Asp 

Asn Gin lie Asn 
100 

Leu Leu Leu Ala 
115 

Leu Glu Gin Glu 
130 

He Lys Val Glu 

145 

Asn Gin He Lys 

Gin Lys Thr Glu 
180 

Thr He Lys Thr 
195 

Gin Glu Asn His 
210 

Gly He Gly He 
225 

Lys Thr Asn Gin 

Gin Pro Arg Ser 
260 

Leu Gin Lys Glu 
275 

Lys Gin Val Asp 
290 

Asp Pro Arg Asp 
305 



Lys Thr Gly Lys 
5 

Thr Lys Ala Asn 

Asn He He Arg 
40 

Val Arg Gly Leu 
55 

Val Val Leu Thr 
70 

Lys Val Gin Asp 
85 

Trp Trp Lys Tyr 

Ala Cys Ser Thr 
120 

Lys Gin Lys Thr 
135 

Gin Glu Lys Gin 
150 

Val Glu Gin Glu 
165 

Gin Glu Arg Gin 

Gin Lys Asp Phe 
200 

Asn Gin Phe Phe 
215 

Glu Val Glu Ala 
230 

Thr Pro He Gin 
245 

Gin Arg Gly. Ser 

Leu Glu Ser Leu 
280 

Phe Tyr Arg Pro 
295 

Phe Asn Val Thr 
310 



Thr Asn Lys Val 
10 

Lys Glu Ala His 
25 

Ser He Arg Gly 

Val Lys Lys His 
60 

His lie Ala Cys 
75 

Lys Ser Lys Gin 
90 

Ser Gly Leu Thr 
105 

Gly Asp Val Ser 

Ser Asn He Glu 
140 

Lys Thr Ser Asn 
155 

Gin Gin Lys Thr 
170 

Lys Thr Glu Gin 
185 

He Lys Tyr Val 

He Glu Lys Gly 
220 

Glu Cys Lys Thr 
235 

Pro Lys His Leu 
250 

Lys Ala Gin Glu 
265 

Pro Tyr Ser Gin 

Ser Ser He Ala 
300 

Glu Glu Trp Gin 
315 



Gly Lys Asn Thr 
15 

Phe Lys Gin Ala 
30 

He Phe Thr Lys 

45 

Pro Lys Lys Ser 

Lys Lys Ala Lys 
80 

Ala Glu Lys Glu 
95 

He Ala Thr Ser 
110 

Glu Gin He Glu 
125 

Thr Asn Asn Gin 

He Glu Thr Asn 
160 

Glu Gin Glu Xaa 
175 

Glu Lys Gin Lys 
190 

Glu Gin Asn Cys 
205 

Gly He Lys Ala 

Pro Lys Pro Ala 
240 

Pro Asn Ser Lys 
255 

Leu He Ala Tyr 
270 

Lys Ala He Ala 
285 

Tyr Leu Glu Leu 

Lys Glu Asn Leu 
320 
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bys 


He 


Arg 


Ser 


Lys 


Ala 


Gin 


Ala 


Lys 


Met 


Leu 


Glu 


Met 


Arg 


Ser 


Leu 






325 










330 










335 




Lys 


Pro 


Asp 


Ser 


Gin 


Ala 


His 


Leu 


Ser 


Thr 


Ser 


Gin 


Ser 


Leu 


Leu 


Phe 




340 










345 










350 






Val 


Gin 


Lys 


He 


Phe 


Ala 


Asp 


Val 


Asn 


Lys 


Glu 


He 


Lys 


Val 


Val 


Ala 






355 










360 










365 








Asn 


Thr 


Glu 


Lys 


Lys 


Ala 


Glu 


Lys 


Ala 


Gly 


Tyr 


Gly 


Tyr 


Ser 


Lys 


Arg 



370 375 380 

Met 
385 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 62... 340 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 35: 

ATTCATTTAC TTTTGAGAAA TATAATTCTC TCGCTTTTAA GATCATCACA AGGAGTTTCG 60 

T ATG AAA AAG CAA ATC TTG AC A GGT GTT TTA TTA TCA GTT TTG GCA GTG 10 9 

Met Lys Lys Gin He Leu Thr Gly Val Leu Leu Ser Val Leu Ala Val 
x 5 10 15 

AGT TCT GCA TAG GCT CAC AAA GAT AAA AAA GAC GCC AAA AAA CCT AAA 15 7 

Ser Ser Ala Tyr Ala His Lys Asp Lys Lys Asp Ala Lys Lys Pro Lys 
20 25 30 

TTT AGC AC A GAA TTA GTC GTG GCT CAA AAC GAC AAA AAA GAC GCT AAA 2 05 

Phe Ser Thr Glu Leu Val Val Ala Gin Asn Asp Lys Lys Asp Ala Lys 
35 40 45 

AAA CCT AAA TTT AGC ACA GAA TTA GTC GTG GCT CAA AAC GAC AAA AAA 253 
Lys Pro Lys Phe Ser Thr Glu Leu Val Val Ala Gin Asn Asp Lys Lys 
50 55 60 

GAC GCT AAA AAA CCT AAA TTT AGC ACA GAA TTA GTC GTG GCT CAA AAC 301 
Asp Ala Lys Lys Pro Lys Phe Ser Thr Glu Leu Val Val Ala Gin Asn 
65 * 70 75 SO 

GAC AAA AAA GAC GCT AAA AAA CCT AAA AAC TCA GTG GTC TAATGGCTTT GA 3 52 
Asp Lys Lys Asp Ala Lys Lys Pro Lys Asn Ser Val Val 
85 90 

CTCTAAAAAA GCGTTTTTAA AAACGCTTTT TTGGATATTA TCCTATAATT TCCTACCA 410 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 

Met Lys Lys Gin lie Leu Thr Gly Val Leu Leu Ser Val Leu Ala Val 

15 10 15 

Ser Ser Ala Tyr Ala His Lys Asp Lys Lys Asp Ala Lys Lys Pro Lys 

20 25 30 

Phe Ser Thr Glu Leu Val Val Ala Gin Asn Asp Lys Lys Asp Ala Lys 

35 40 45 

Lys Pro Lys Phe Ser Thr Glu Leu Val Val Ala Gin Asn Asp Lys Lys 

50 55 60 

Asp Ala Lys Lys Pro Lys Phe Ser Thr Glu Leu Val Val Ala Gin Asn 
65 70 75 80 

Asp Lys Lys Asp Ala Lys Lys Pro Lys Asn Ser Val Val 
85 90 

(2) INFORMATION FOR SEQ ID NO : 3 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 67... 2046 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TAAAAACCCC TATCATAGGG CGTGGCATGA AGAAAAAAGC AAAAGTCTTT TGGTATTGTT 6 0 

TTAATC ATG ATT TAT TGG TTG TAT TTG GCG GTC TTT TTT TTG TTG AGC 108 
Met lie Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser 
15 10 

GCA TTA GAC GCT AAA GAA ATC GCT ATG CAA CGA TTT GAC AAA CAA AAC 15 6 

Ala Leu Asp Ala Lys Glu lie Ala Met Gin Arg Phe Asp Lys Gin Asn 
15 20 25 30 

CAT AAG'ATT TTT GAA ATC CTT GCG GAT AAA GTG AGC GCT AAA GAC AAT 2 04 

His Lys lie Phe Glu lie Leu Ala Asp Lys Val Ser Ala Lys Asp Asn 
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35 40 45 

GTG ATA ACC GCA TCA GGG AAT GCG ATC TTA TTG AAT TAT GAT GTG TAT 2 52 

Val He Thr Ala Ser Gly Asn Ala He Leu Leu Asn Tyr Asp Val Tyr 
50 55 60 

ATT CTA GCG GAC AAG GTG CGT TAT GAC ACT AAA ACC AAA GAA GCG TTA 3 00 

He Leu Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu 
65 70 75 

TTA GAG GGG AAT ATC AAG GTT TAT AGG GGC GAG GGT TTG CTC GTT AAA 34 8 

Leu Glu Gly Asn He Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys 
80 85 90 

ACC GAT TAC GTG AAA TTG AGT TTG AAT GAA AAA TAT GAA ATC ATT TTC 3 96 

Thr Asp Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu He He Phe 
95 100 105 HO 

CCC TTT TAT GTC CAA GAC AGC GTG AGC GGG ATT TGG GTG AGC GCG GAT 444 
Pro Phe Tyr Val Gin Asp Ser Val Ser Gly He Trp Val Ser Ala Asp 
115 120 125 

ATT GCC AGC GGA AAG GAT CAA AAA TAT AAG GTT AAA AAC ATG AGC ACT 4 92 

He Ala Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr 
130 135 140 

TCA GGG TGC AGC ATT GAT AAC CCC ATT TGG CAT GTC AAT GCG ACT TCA 54 0 

Ser Gly Cys Ser He Asp Asn Pro He Trp His Val Asn Ala Thr Ser 
145 150 155 

GGC TCA TTC AAC ATG CAA AAA TCG CAT TTG TCT ATG TGG AAT CCT AAG 58 8 

Gly Ser Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys 
160 165 170 

ATC TAT GTC GGT GAT ATT CCT GTA TTG TAT TTG CCC TAT ATT TTC ATG 63 6 

lie Tyr Val Gly Asp He Pro Val Leu Tyr Leu Pro Tyr He Phe Met 
175 180 185 190 

TCC ACG AGC AAT AAA AGA ACT ACT GGG TTT TTA TAC CCT GAG TTT GGC 6 84 

Ser Thr Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly 
195 200 205 

ACT TCC AAC TTA GAC GGC TTT ATT TAT TTG CAA CCC TTT TAT TTA GCC 7 32 

Thr Ser Asn Leu Asp Gly Phe He Tyr Leu Gin Pro Phe Tyr Leu Ala 
210 215 220 

CCC AAA AAC TCA TGG GAT ATG ACC TTT ACC CCA CAA ATC CGC TAT AAA 7 80 

Pro Lys Asn Ser Trp Asp Met Thr Phe Thr Pro Gin He Arg Tyr Lys 
225 230 235 

AGG GGT TTT GGC TTG AAT TTT GAA GCG CGC TAC ATT AAC TCT AAA AAC 82 8 

Arg Gly Phe Gly Leu Asn Phe Glu Ala Arg Tyr He Asn Ser Lys Asn 
240 245 250 

GAC AGG TTT TTA TTC AAC GCG CGC TAT TTT AGG AAT TAC ACC CAA TAT 8 76 

Asp Arg Phe Leu Phe Asn Ala Arg Tyr Phe Arg Asn Tyr Thr Gin Tyr 
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255 , 260 265 270 

GTC AAA CGC TAC GAT TTG AGG AAT CAA AAT ATC TAC GGG TTT GAA TTT 92 4 

Val Lys Arg Tyr Asp Leu Arg. Asn Gin Asn lie Tyr Gly Phe Glu Phe 
275 280 285 

TTA AGC TCT AGC AGG GAC ACT TTA CAA AAA TAC TTC CAC CTT AAG TCT 97 2 

Leu Ser Ser Ser Arg Asp Thr Leu Gin Lys Tyr Phe His Leu Lys Ser 
290 295 300 

AAT ATT GAC AAC GGG CAT TAC ATT - GAC TTT TTA TAC ATG AAC GAT TTG 102 0 

Asn lie Asp Asn Gly His Tyr lie Asp Phe Leu Tyr Met Asn Asp Leu 
305 310 315 

GAC TAT GTG CGT TTT GAA AAG GTT AAT AAG CGT ATC ACA GAC GCC ACG 10 68 

Asp Tyr Val Arg Phe Glu Lys Val Asn Lys Arg lie Thr Asp Ala Thr 
320 325 330 

CAC ATG TCT AGG GCG AAT TAC TAT TTG CAA ACA GAA AAC AAT TAT TAC 1116 
His Met Ser Arg Ala Asn Tyr Tyr Leu Gin Thr Glu Asn Asn Tyr Tyr 
335 340 345 350 

GGC TTG AAT ATC AAG TAT TTT TTA AAC CTG AAT AAA ATC AAC AAT AAC 1164 
Gly Leu Asn lie Lys Tyr Phe Leu Asn Leu Asn Lys lie Asn Asn Asn 
355 360 365 

CGC ACT TTC CAA TCT GTC CCT AAT TTG CAA TAC CAT AAA TAT TTA AAT 1212 
Arg Thr Phe Gin Ser Val Pro Asn Leu Gin Tyr His Lys Tyr Leu Asn 
370 375 380 

TCT TTG TAT TTT AG A AAT TTG TTG TAT TCG GTG GAT TAT CAG TTT AGA 12 6 0 

Ser Leu Tyr Phe Arg Asn Leu Leu Tyr Ser Val Asp Tyr Gin Phe Arg 
385 390 395 

AAC ACC GCA AGA GAG ATT GGT TAT GGC TAT GTG CAA AAC GCT TTG AAT 13 08 

Asn Thr Ala Arg Glu lie Gly Tyr Gly Tyr Val Gin Asn Ala Leu Asn 
400 405 410 

GTG CCG GTG GGC TTG CAA TTT TCT TTG TTT AAA AAG TAT TTG TCT TTA 13 56 

Val Pro Val Gly Leu Gin Phe Ser Leu Phe Lys Lys Tyr Leu Ser Leu 
415 420 425 430 

GGG CTT TGG AAT GAT CTC CAA CTA TCT AAT GTG GCT TTA ATG CAA TCT 14 04 

Gly Leu Trp Asn Asp Leu Gin Leu Ser Asn Val Ala Leu Met Gin Ser 
435 440 445 

AAA AAT TCC TTC GTG CCT ACG ATC CCT AAT GAA TCA AGG GAA TTT GGG 14 52 

Lys Asn Ser Phe Val Pro Thr lie Pro Asn Glu Ser Arg Glu Phe Gly 
450 455 460 

AAT TTT GTG TCT TCA AAT TTT TCC ATG TAT GTC AAT ACG GAT TTG GCT 15 00 

Asn Phe Val Ser Ser Asn Phe Ser Met Tyr Val Asn Thr Asp Leu Ala 
465 470 475 

AGA GAA TAC AAC AAG CTT TTC CAC ACG ATC CAA CTA GAA GCG ATT TTC 154 8 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 



-130- 



Arg Glu Tyr Asn Lys Leu Phe His Thr lie Gin Leu Glu Ala lie Phe 
480 485 490 

AAC ATC CCT TAT TAC ACC TTT AAA AAC GGC TTA TTT TCT CAA AAC ATG 15 96 

Asn He Pro Tyr Tyr Thr Phe Lys Asn Gly Leu Phe Ser Gin Asn Met 
495 500 505 510 

TAT GCT TTA AGC GCG CAA GCC TTA AAC AGC TAC ACT TCG CCT TTA TTG 1644 
Tyr Ala Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu 
515 520 525 

AGA GAT TAT GAT TAT CAA GGG CGT TTG TAT GAC TCG GTG TGG AAT CCT 16 92 

Arg Asp Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro 
530 535 540 

AGC AGT ATT TTA CCT AGC AAT GCG AGC AAC AAG ACG GTG GAT TTA ACC 174 0 

Ser Ser He Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr 
545 550 555 

CTA ACG CAA TAC CTT TAT GGC TTA GGG GGG CAA GAG TTA TTG TAT TTT 17 8 8 

Leu Thr Gin Tyr Leu Tyr Gly Leu Gly Gly Gin Glu Leu Leu Tyr Phe 
560 565 570 

AAA ATA TCG CAA CTC ATC AAT CTT GAC GAT AAA GTT TCG CCC TTT AGA 18 3 6 

Lys He Ser Gin Leu lie Asn Leu Asp Asp Lys Val Ser Pro Phe Arg 
575 580 585 590 

ATG CCA CTA GAG AGC AAG ATC GGG TTT TCG CCC TTA ACG GGA TTG AAC 18 84 

Met Pro Leu Glu Ser Lys He Gly Phe Ser Pro Leu Thr Gly Leu Asn 
595 600 605 

ATC TTT GGG AAT GTC TTT TAT TCG TTT TAT CAA AAC CGC TTA GAA GAA 193 2 

He Phe Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu 
610 615 620 

ATC TCT GTG AAC GCC AAT TAC CAA CGC AAG TTT TTA AGC TTT AAC CTC 198 0 

He Ser Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu 
625 630 635 

TCT TAT TTT TTA AAA AAC AAT TTT AGC AGT GGG ATT AAT AGC ATT GTA 2 02 8 

Ser Tyr Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val 
640 645 650 

GAA AAT CTG CGG ATT ATT TAAAGGCGGG TTTTAGCAAC GACTTTGGCT ATTTTTCC 2 0 84 

Glu Asn Leu Arg He He 

655 660 

ATGAGCGCGG ATG 2097 



(2) INFORMATION FOR SEQ ID NO : 3 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Met lie Tyr Trp Leu Tyr Leu Ala Val Phe Phe Leu Leu Ser Ala Leu 

15 10 15 

Asp Ala Lys Glu lie Ala Met Gin Arg Phe Asp Lys Gin Asn His Lys 

20 25 30 

lie Phe Glu lie Leu Ala Asp Lys Val Ser Ala Lys Asp Asn Val lie 

35 40 45 

Thr Ala Ser Gly Asn Ala lie Leu Leu Asn Tyr Asp Val Tyr lie Leu 

50 55 60 

Ala Asp Lys Val Arg Tyr Asp Thr Lys Thr Lys Glu Ala Leu Leu Glu 
65 70 75 80 

Gly Asn lie Lys Val Tyr Arg Gly Glu Gly Leu Leu Val Lys Thr Asp 

85 90 95 

Tyr Val Lys Leu Ser Leu Asn Glu Lys Tyr Glu lie lie Phe Pro Phe 

100 105 110 

Tyr Val Gin Asp Ser Val Ser Gly lie Trp Val Ser Ala Asp lie Ala 

115 120 125 

Ser Gly Lys Asp Gin Lys Tyr Lys Val Lys Asn Met Ser Thr Ser Gly 

130 135 140 

Cys Ser He Asp Asn Pro He Trp His Val Asn Ala Thr Ser Gly Ser 
145 150 155 160 

Phe Asn Met Gin Lys Ser His Leu Ser Met Trp Asn Pro Lys He Tyr 

165 170 175 

Val Gly Asp He Pro Val Leu Tyr Leu Pro Tyr He Phe Met Ser Thr 

180 185 190 

Ser Asn Lys Arg Thr Thr Gly Phe Leu Tyr Pro Glu Phe Gly Thr Ser 

195 200 205 

Asn Leu Asp Gly Phe He Tyr Leu Gin Pro Phe Tyr Leu Ala Pro Lys 

210 215 220 

Asn Ser Trp Asp Met Thr Phe Thr Pro Gin He Arg Tyr Lys Arg Gly 
225 230 235 240 

Phe Gly Leu Asn Phe Glu Ala Arg Tyr He Asn Ser Lys Asn Asp Arg 

245 250 255 

Phe Leu Phe Asn Ala Arg Tyr Phe Arg Asn Tyr Thr Gin Tyr Val Lys 

260 265 270 

Arg Tyr Asp Leu Arg Asn Gin Asn He Tyr Gly Phe Glu Phe Leu Ser 

275 280 285 

Ser Ser Arg Asp Thr Leu Gin Lys Tyr Phe His Leu Lys Ser Asn He 

290 295 300 

Asp Asn Gly His Tyr He Asp Phe Leu Tyr Met Asn Asp Leu Asp Tyr 
305 310 315 320 

Val Arg Phe Glu Lys Val Asn Lys Arg He Thr Asp Ala Thr His Met 

325 330 335 

Ser Arg Ala Asn Tyr Tyr Leu Gin Thr Glu Asn Asn Tyr Tyr Gly Leu 

340 345 350 

Asn He Lys Tyr Phe Leu Asn Leu Asn Lys He Asn Asn Asn Arg Thr 

355 360 365 

Phe Gin Ser Val Pro Asn Leu Gin Tyr His Lys Tyr Leu Asn Ser Leu 

370 375 380 

Tyr Phe Arg Asn Leu Leu Tyr Ser Val Asp Tyr Gin Phe Arg Asn Thr 
385 390 395 400 
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Ala Arg Glu lie Gly Tyr Gly Tyr Val Gin Asn Ala Leu Asn Val Pro 

405 410 415 

Val Gly Leu Gin Phe Ser Leu Phe Lys Lys Tyr Leu Ser Leu Gly Leu 

420 425 430 

Trp Asn Asp Leu Gin Leu Ser Asn Val Ala Leu Met Gin Ser Lys Asn 

435 440 445 

Ser Phe Val Pro Thr lie Pro Asn Glu Ser Arg Glu Phe Gly Asn Phe 

450 455 460 

Val Ser Ser Asn Phe Ser Met Tyr Val Asn Thr Asp Leu Ala Arg Glu 
465 470 475 480 

Tyr Asn Lys Leu Phe His Thr lie Gin Leu Glu Ala lie Phe Asn lie 

485 490 495 

Pro Tyr Tyr Thr Phe Lys Asn Gly Leu Phe Ser Gin Asn Met Tyr Ala 

500 505 510 

Leu Ser Ala Gin Ala Leu Asn Ser Tyr Thr Ser Pro Leu Leu Arg Asp 

515 520 525 

Tyr Asp Tyr Gin Gly Arg Leu Tyr Asp Ser Val Trp Asn Pro Ser Ser 

530 535 540 

lie Leu Pro Ser Asn Ala Ser Asn Lys Thr Val Asp Leu Thr Leu Thr 
545 550 555 560 

Gin Tyr Leu Tyr Gly Leu Gly Gly Gin Glu Leu Leu Tyr Phe Lys lie 

565 570 575 

Ser Gin Leu lie Asn Leu Asp Asp Lys Val Ser Pro Phe Arg Met Pro 

580 585 590 

Leu Glu Ser Lys lie Gly Phe Ser Pro Leu Thr Gly Leu Asn lie Phe 

595 600 605 

Gly Asn Val Phe Tyr Ser Phe Tyr Gin Asn Arg Leu Glu Glu lie Ser 

610 615 620 

Val Asn Ala Asn Tyr Gin Arg Lys Phe Leu Ser Phe Asn Leu Ser Tyr 
625 630 635 640 

Phe Leu Lys Asn Asn Phe Ser Ser Gly He Asn Ser He Val Glu Asn 
645 650 655 

Leu Arg He He 
660 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 168... 764 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: 

ATGCCGATTA AATGCATGCT GATTAAATGA 
GCACCACGCT TGAAATTAAA ACTAAATTTT 



SEQ ID NO: 39 : 

ATGAAAAGAG TCCAAACCAC CGCCTTTAAC 60 
AGTGTATTCT TAGCAAATTT TAGATAAGAT 12 0 
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CAAGCGTGAT TTTTTCTAAA TTTTAGGCAT TTAAGGAATC AGTGTTT ATG ACA AGC 17 6 

Met Thr Ser 
1 

GCT CTG TTA GGC TTA CAA ATT GTT TTA GCG GTA TTG ATT GTG GTG GTG 22 4 

Ala Leu Leu Gly Leu Gin lie Val Leu Ala Val Leu lie Val Val Val 
5 10 15 

GTT TTG TTG CAA AAA AGT TCT AGC ATC GGC TTA GGG GCT TAT AGC GGG 2 72 

Val Leu Leu Gin Lys Ser Ser Ser lie Gly Leu Gly Ala Tyr Ser Gly 
20 25 30 35 

AGT AAT GAG TCT TTA TTT GGC GCT AAA GGG CCT GCA AGC TTT ATG GCG 320 
Ser Asn Glu Ser Leu Phe Gly Ala Lys Gly Pro Ala Ser Phe Met Ala 
40 45 50 

AAA TTA ACC ATG TTT TTA GGG CTG TTA TTT GTC ATC AAC ACC ATC GCT 36 8 

Lys Leu Thr Met Phe Leu Gly Leu Leu Phe Val lie Asn Thr lie Ala 
55 60 65 

TTG GGC TAT TTT TAC AAC AAA GAA TAC GGC AAG AGC GTT TTA GAT GAG 416 
Leu Gly Tyr Phe Tyr Asn Lys Glu Tyr Gly Lys Ser Val Leu Asp Glu 
70 75 80 

ACT AAA ACC AAC AAA GAA CTT TCG CCC CTA GTC CCT GCC ACC GGC ACG 4 64 

Thr Lys Thr Asn Lys Glu Leu Ser Pro Leu Val Pro Ala Thr Gly Thr 
85 90 95 

CTT AAC CCT GCA CTT AAT CCC ACA TTA AAC CCA ACG CTC AAC CCT TTA 512 
Leu Asn Pro Ala Leu Asn Pro Thr Leu Asn Pro Thr Leu Asn Pro Leu 
100 105 110 115 

GAG CAA GCC CCA ACT AAT CCT TTA ATG CCA CAA CAA ACG CCT AAC GAA 5 60 

Glu Gin Ala Pro Thr Asn Pro Leu Met Pro Gin Gin Thr Pro Asn Glu 
120 125 130 

CTC CCT AAA GAG CCA GCC AAA ACG CCT TCT GTT GAA AGC CCC AAA CAG 6 08 

Leu Pro Lys Glu Pro Ala Lys Thr Pro Ser Val Glu Ser Pro Lys Gin 
135 140 145 

AAT GAA AAG AAT GAA AAG AAT GAC GCC AAA GAG AAT GGT ATA AAG GGT 6 56 

Asn Glu Lys Asn Glu Lys Asn Asp Ala Lys Glu Asn Gly lie Lys Gly 
150 155 160 

GTT GAA AAA ACC AAA GAG AAC GCC AAA ACG CCC CCA ACC ACC CAC CAA 7 04 

Val Glu Lys Thr Lys Glu Asn Ala Lys Thr Pro Pro Thr Thr His Gin 
165 170 175 

AAG CCT AAA ACG CAT GCA ACG CAA ACC AAC GCC CAT ACC AAC CAA AAA 7 52 

Lys Pro Lys Thr His Ala Thr Gin Thr Asn Ala His Thr Asn Gin Lys 
180 185 190 195 

AAG GAT GAA AAA TAATGTTACA GGCCATTTAT AACGAAACCA AAGATCTGAT GCAAA 80 9 
Lys Asp Glu Lys 
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AAAGCATTCA AGCTTTAAAC AGGGATTTTT CCACTCTAAG GAG CG CG AAA GTTTCAGTCA 869 

ATATTTTAGA TCACATCAAA GTGGATTATT ACGGCACGCC CACGGCATTA AATCAAGTCG 92 9 

GATCCGTGAT GAG CTTGG AT GCGACCACCC TT 961 

(2) INFORMATION FOR SEQ ID NO : 4 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 

Met Thr Ser Ala Leu Leu Gly Leu Gin lie Val Leu Ala Val Leu lie 

15 10 15 

Val Val Val Val Leu Leu Gin Lys Ser Ser Ser lie Gly Leu Gly Ala 

20 25 30 

Tyr Ser Gly Ser Asn Glu Ser Leu Phe Gly Ala Lys Gly Pro Ala Ser 

35 40 45 

Phe Met Ala Lys Leu Thr Met Phe Leu Gly Leu Leu Phe Val lie Asn 

50 55 60 

Thr lie Ala Leu Gly Tyr Phe Tyr Asn Lys Glu Tyr Gly Lys Ser Val 
65 70 75 80 

Leu Asp Glu Thr Lys Thr Asn Lys Glu Leu Ser Pro Leu Val Pro Ala 

85 90 95 

Thr Gly Thr Leu Asn Pro Ala Leu Asn Pro Thr Leu Asn Pro Thr Leu 

100 105 iio 

Asn Pro Leu Glu Gin Ala Pro Thr Asn Pro Leu Met Pro Gin Gin Thr 

115 120 125 

Pro Asn Glu Leu Pro Lys Glu Pro Ala Lys Thr Pro Ser Val Glu Ser 

130 135 140 

Pro Lys Gin Asn Glu Lys Asn Glu Lys Asn Asp Ala Lys Glu Asn Gly 
145 150 155 160 

lie Lys Gly Val Glu Lys Thr Lys Glu Asn Ala Lys Thr Pro Pro Thr 

165 170 175 

Thr His Gin Lys Pro Lys Thr His Ala Thr Gin Thr Asn Ala His Thr 

180 185 190 

Asn Gin Lys Lys Asp Glu Lys 
195 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 
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(BJ LOCATION: 325... 879 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

CCTAGTCCCT GCCACCGGCA CGCTTAACCC TGCACTTAAT C C C AC ATT AA ACCCAACGCT 6 0 

CAACCCTTTA GAGCAAGCCC CAACTAATCC TTTAATGCCA CAACAAACGC CTAACGAACT 12 0 

CCCTAAAGAG CCAGCCAAAA CGCCTTCTGT TGAAAGCCCC AAACAGAATG AAAAGAATGA 18 0 

AAAGAATGAC GCCAAAGAGA ATG G TAT AAA GGGTGTTGAA AAAACCAAAG AGAACGCCAA 24 0 

AACGCCCCCA ACCACCCACC AAAAGCCTAA AACGCATGCA ACGCAAACCA ACGCCCATAC 3 00 

CAACCAAAAA AAGGATGAAA AATA ATG TTA CAG GCC ATT TAT AAC GAA ACC 351 

Met Leu Gin Ala lie Tyr Asn Glu Thr 
1 5 

AAA GAT CTG ATG CAA AAA AGC ATT CAA GCT TTA AAC AGG GAT TTT TCC 3 99 

Lys Asp Leu Met Gin Lys Ser lie Gin Ala Leu Asn Arg Asp Phe Ser 
10 15 20 25 

ACT CTA AGG AGC GCG AAA GTT TCA GTC AAT ATT TTA GAT CAC ATC AAA 44 7 

Thr Leu Arg Ser Ala Lys Val Ser Val Asn lie Leu Asp His lie Lys 
30 35 40 

GTG GAT TAT TAC GGC ACG CCC ACG GCA TTA AAT CAA GTC GGA TCC GTG 4 95 

Val Asp Tyr Tyr Gly Thr Pro Thr Ala Leu Asn Gin Val Gly Ser Val 
45 50 55 

ATG AGC TTG GAT GCG ACC ACC CTT CAA ATC AGC CCA TGG GAA AAA AAC 54 3 

Met Ser Leu Asp Ala Thr Thr Leu Gin lie Ser Pro Trp Glu Lys Asn 
60 65 70 

CTG CTC AAA GAA ATT GAA AGA TCC ATT CAA GAA GCC AAT ATT GGT GTC 5 91 

Leu Leu Lys Glu lie Glu Arg Ser lie Gin Glu Ala Asn lie Gly Val 
75 80 85 

AAT CCT AAT AAC GAC GGC GAA ACG ATC AAG CTT TTT TTC CCG CCC ATG 63 9 

Asn Pro Asn Asn Asp Gly Glu Thr lie Lys Leu Phe Phe Pro Pro Met 
90 95 100 105 

ACA AGT GAG CAA AGA AAA CTC ATC GCA AAA GAC GCC AAA GCG ATG GGT 6 87 

Thr Ser Glu Gin Arg Lys Leu lie Ala Lys Asp Ala Lys Ala Met Gly 
110 115 120 

GAA AAG GCT AAA GTG GCT GTG AGG AAT ATC CGC CAA GAT GCT AAC AAC 73 5 

Glu Lys Ala Lys Val Ala Val Arg Asn lie Arg Gin Asp Ala Asn Asn 
125 130 135 

CAG GTG AAA AAA TTA GAA AAA GAC AAA GAA ATC AGC GAA GAT GAA AGC 783 
Gin Val Lys Lys Leu Glu Lys Asp Lys Glu lie Ser Glu Asp Glu Ser 
140 145 150 

AAA AAA GCC CAA GAG CAG ATC CAA AAA ATC ACC GAT GAA GCC ATT AAA 831 
Lys Lys Ala Gin Glu Gin lie Gin Lys lie Thr Asp Glu Ala lie Lys 
155 160 165 
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AAA ATT GAT GAA AGC GTG AAA AAC AAA GAA GAC GCG ATC TTA AAG GTC T 8 80 

Lys He Asp Glu Ser Val Lys Asn Lys Glu Asp Ala He Leu Lys Val 

170 175 180 185 

AAACCATGGA TATTAAGGCA TGTTATCAAA ACGCTAAAGC G TT ATT AG AG GGGCATTTCT 94 0 

TGCTCAGCAG TGGGTTT CAT TCCAATTATT ATTTGCAATC CGCTAAAGTT TTAGAAGATC 10 0 0 
CCAAACTAGC CGAACAATTA GCGCTAGAAT TAGCCAAACA AATCCAAGAA GCTCATTT 105 8 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 2 : 



Met 


Leu 


Gin 


Ala 


He 


Tyr 


Asn 


Glu 


Thr 


Lys 


Asp 


Leu 


Met 


Gin 


Lys 


Ser 


1 








5 










10 










15 




He 


Gin 


Ala 


Leu 


Asn 


Arg 


Asp 


Phe 


Ser 


Thr 


Leu 


Arg 


Ser 


Ala 


Lys 


Val 








20 










25 










30 






Ser 


Val 


Asn 


He 


Leu 


Asp 


His 


He 


Lys 


Val 


Asp 


Tyr 


Tyr 


Gly 


Thr 


Pro 






35 










40 










45 








Thr 


Ala 


Leu 


Asn 


Gin 


Val 


Gly 


Ser 


Val 


Met 


Ser 


Leu 


Asp 


Ala 


Thr 


Thr 




50 










55 










60 










Leu 


Gin 


He 


Ser 


Pro 


Trp 


Glu 


Lys 


Asn 


Leu 


Leu 


Lys 


Glu 


He 


Glu 


Arg 


65 










70 










75 










80 


Ser 


He 


Gin 


Glu 


Ala 


Asn 


He 


Gly 


Val 


Asn 


Pro 


Asn 


Asn 


Asp 


Gly 


Glu 










85 










90 










95 




Thr 


He 


Lys 


Leu 


Phe 


Phe 


Pro 


Pro 


Met 


Thr 


Ser 


Glu 


Gin 


Arg 


Lys 


Leu 






100 










105 










110 






He 


Ala 


Lys 


Asp 


Ala 


Lys 


Ala 


Met 


Gly 


Glu 


Lys 


Ala 


Lys 


Val 


Ala 


Val 






115 










120 










125 








Arg 


Asn 


He 


Arg 


Gin 


Asp 


Ala 


Asn 


Asn 


Gin 


Val 


Lys 


Lys 


Leu 


Glu 


Lys 




130 










135 










140 










Asp 


Lys 


Glu 


He 


Ser 


Glu 


Asp 


Glu 


Ser 


Lys 


Lys 


Ala 


Gin 


Glu 


Gin 


He 


145 










150 










155 










160 


Gin 


Lys 


He 


Thr 


Asp 


Glu 


Ala 


He 


Lys 


Lys 


He 


Asp 


Glu 


Ser 


Val 


Lys 








165 










170 










175 




Asn 


Lys 


Glu 


Asp 


Ala 


He 


Leu 


Lys 


Val 

















180 185 
(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 
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(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 163... 1389 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 

GAGTGGATGA AAAAGACACT TTCAATTTTG CAAAAATTGG CTATGAACAG GGCAAGGGCG 6 0 

AAGAATTAAA AGAAGTAGAA GAAAAGCATG CGTTTAAGAA AATCCCTTTT GTCAAAGATT 12 0 

TGCACAAAAT CGCCCCCACT ATCTTAAAAA AGAGGCTATA AA ATG GCT CAA AAT 174 

Met Ala Gin Asn 
1 

TTC ACG AAA CTC AAC CCC CAG TTT GAA AAC ATC ATT TTT GAA CAT GAC 22 2 

Phe Thr Lys Leu Asn Pro Gin Phe Glu Asn lie lie Phe Glu His Asp 
5 10 15 20 

GAC AAC CAA ATG ATT TTA AAC TTT GGC CCC CAA CAC CCC AGT AGT CAT 27 0 

Asp Asn Gin Met He Leu Asn Phe Gly Pro Gin His Pro Ser Ser His 
25 30 35 

GGG CAA TTG CGC TTG ATT TTG GAA TTA GAG GGC GAA AAA ATC ATT AAG 318 
Gly Gin Leu Arg Leu He Leu Glu Leu Glu Gly Glu Lys He He Lys 
40 45 50 

GCT ACC CCT GAA ATT GGC TAC TTG CAT AGA GGC TGT GAA AAG TTA GGC 3 66 

Ala Thr Pro Glu He Gly Tyr Leu His Arg Gly Cys Glu Lys Leu Gly 
55 60 65 

GAA AAC ATG ACC TAT AAC GAA TAC ATG CCC ACT ACT GAT AGA TTG GAT 414 
Glu Asn Met Thr Tyr Asn Glu Tyr Met Pro Thr Thr Asp Arg Leu Asp 
70 75 80 

TAC ACT TCT TCT ACC AGC AAT AAT TAC GCT TAC GCT TAT GCG GTA GAG 4 62 

Tyr Thr Ser Ser Thr Ser Asn Asn Tyr Ala Tyr Ala Tyr Ala Val Glu 
85 90 95 100 

ACC TTA CTC AAT TTA GAA ATC CCA CGC CGA GCG CAG GTG ATC CGC ACG 510 
Thr Leu Leu Asn Leu Glu He Pro Arg Arg Ala Gin Val He Arg Thr 
105 HO 115 

ATT TTA CTA GAG CTT AAC CGC ATG ATC TCA CAC ATC TTT TTT ATC AGC 558 
He Leu Leu Glu Leu Asn Arg Met He Ser His He Phe Phe He Ser 
120 125 130 

GTG CAT GCT TTA GAT GTG GGG GCG ATG AGC GTG TTT TTG TAT GCG TTT 6 06 

Val His Ala Leu Asp Val Gly Ala Met Ser Val Phe Leu Tyr Ala Phe 
135 140 145 

AAA ACG AGG GAA TAC GGC TTG GAT TTG ATG GAG GAT TAT TGC GGG GCT 6 54 

Lys Thr Arg Glu Tyr Gly Leu Asp Leu Met Glu Asp Tyr Cys Gly Ala 
150 155 160 

AGG CTC ACG CAT AAC GCT ATA AGG ATT GGG GGC GTG CCT TTA GAT TTA 7 02 
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Arg Leu Thr His Asn Ala He Arg He Gly Gly Val Pro Leu Asp Leu 
165 170 175 180 

CCC CCT AAT TGG TTA GAA GGC TTA AAA AAG TTT TTA GGC GAA ATG AGG 75 0 

Pro Pro Asn Trp Leu Glu Gly Leu Lys Lys Phe Leu Gly Glu Met Arg 
185 190 195 

GAA TGC AAA AAA CTC ATT CAA GGC TTA TTG GAT AAG AAT CGC ATT TGG 79 8 

Glu Cys Lys Lys Leu He Gin Gly Leu Leu Asp Lys Asn Arg He Trp 
200 205 210 

CGG ATG CGC TTG GAA AAT GTG GGC GTT GTA ACG CAA AAA ATG GCG CAA 84 6 

Arg Met Arg Leu Glu Asn Val Gly Val Val Thr Gin Lys Met Ala Gin 
215 220 225 

AGC TGG GGC ATG AGC GGT ATC ATG TTA AGA GGG ACT GGG ATC GCT TAT 8 94 

Ser Trp Gly Met Ser Gly He Met Leu Arg Gly Thr Gly He Ala Tyr 
230 235 240 

GAC ATC AGA AAA GAA GAG CCT TAT GAG CTT TAT AAA GAG CTT GAT TTT 94 2 

Asp He Arg Lys Glu Glu Pro Tyr Glu Leu Tyr Lys Glu Leu Asp Phe 
245 " 250 255 260 

GAT GTG CCG GTG GGC AAT TAT GGC GAT AGT TAT GAT AGG TAT TGT TTG 99 0 

Asp Val Pro Val Gly Asn Tyr Gly Asp Ser Tyr Asp Arg Tyr Cys Leu 
265 270 275 

TAT ATG TTA GAA ATT GAT GAA AGC GTT CGC ATC ATT GAA CAG CTC ATT 103 8 

Tyr Met Leu Glu He Asp Glu Ser Val Arg He He Glu Gin Leu He 
280 285 290 

CCT ATG TAT GCT AAA ACC GAT ACG CCT ATC ATG GCT CAA AAC CCG CAT . 10 8 6 
Pro Met Tyr Ala Lys Thr Asp Thr Pro He Met Ala Gin Asn Pro His 
295 300 305 

TAT ATT TCC GCC CCT AAA GAA GAT ATA ATG ACG CAA AAC TAC GCC TTG 113 4 

Tyr He Ser Ala Pro Lys Glu Asp He Met Thr Gin Asn Tyr Ala Leu 
310 315 320 

ATG CAG CAT TTT GTT TTA GTG GCT CAG GGC ATG CGT CCG CCC GTT GGG 118 2 

Met Gin His Phe Val Leu Val Ala Gin Gly Met Arg Pro Pro Val Gly 
325 330 335 340 

GAA GTG TAT GCC CCC ACA GAA AGC CCT AAA GGG GAA TTA GGG TTT TTT 123 0 

Glu Val Tyr Ala Pro Thr Glu Ser Pro Lys Gly Glu Leu Gly Phe Phe 
345 350 355 

ATC CAT TCA GAG GGC GAG CCT TAC CCT CAC AGG CTA AAA ATC AGA GCC 12 7 8 

He His Ser Glu Gly Glu Pro Tyr Pro His Arg Leu Lys He Arg Ala 
360 365 370 

CCT AGC TTT TAT CAC ATT GGG GCT TTG AGC GAC ATT TTA GTG GGG CAA 13 2 6 

Pro Ser Phe Tyr His He Gly Ala Leu Ser Asp He Leu Val Gly Gin 
375 380 385 
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TAT TTA GCG GAT GCA GTA ACC GTG ATT GGC TCA ACC AAT GCG GTG TTT 13 7 4 

Tyr Leu Ala Asp Ala Val Thr Val lie Gly Ser Thr Asn Ala Val Phe 
390 395 400 

GGC GAG GTG GAT AGA TGAAACGCTT TGATTTACGC CCCTTAAAAG CGGGTATTTT T 14 3 0 

Gly Glu Val Asp Arg 

405 

GAACGCTTAG AAGAATTGAT TGAAAAAGAA ATGCAACCTA ATGAAGTCGC TATTTTCATG 14 90 

TTTGAAGTGG GGGATTTTTC TAATATCCCT AAGAGCGCTG AATTTATCCA ATCTAAAGGG 155 0 

CATGAGCTCC TCAATTCTTT GCGTTTCAAT CAAGCGGATT GGACGATTGT CGTGAGAAAA 1610 

AAGGCTTGAT TTTGAGCGGC TTTAACCCCT TAAATTCTCC CTTAGTCGCA AGCTCTTCT 166 9 

(2) INFORMATION FOR SEQ ID NO:44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 



(xi) SEQUE 

Met Ala Gin Asn 
1 

Phe Glu His Asp 
20 

Pro Ser Ser His 
35 

Lys lie lie Lys 
50 

Glu Lys Leu Gly 
65 

Asp Arg Leu Asp 

Tyr Ala Val Glu 
100 

Val lie Arg Thr 
115 

Phe Phe lie Ser 
130 

Leu Tyr Ala Phe 
145 

Tyr Cys Gly Ala 

Pro Leu Asp Leu 
180 

Gly Glu Met Arg 
195 

Asn Arg lie Trp 

210 ~~ 
Lys Met Ala Gin 



INCE DESCRIPTION: 

Phe Thr Lys Leu 

5 

Asp Asn Gin Met 

Gly Gin Leu Arg 
40 

Ala Thr Pro Glu 
55 

Glu Asn Met Thr 
70 

Tyr Thr Ser Ser 
85 

Thr Leu Leu Asn 

lie Leu Leu Glu 
120 

Val His Ala Leu 
135 

Lys Thr Arg Glu 
150 

Arg Leu Thr His 
165 

Pro Pro Asn Trp 

Glu Cys Lys Lys 
200 

Arg Met Arg Leu 
215 

Ser Trp Gly Met 



SEQ ID NO: 44 : 

Asn Pro Gin Phe 
10 

lie Leu Asn Phe 
25 

Leu lie Leu Glu 

lie Gly Tyr Leu 
60 

Tyr Asn Glu Tyr 
75 

Thr Ser Asn Asn 
90 

Leu Glu lie Pro 
105 

Leu Asn Arg Met 

Asp Val Gly Ala 
140 

Tyr Gly Leu Asp 
155 

Asn Ala lie Arg 
170 

Leu Glu Gly Leu 
185 

Leu lie Gin Gly 

Glu Asn Val Gly 
220 

Ser Gly lie Met 



Glu Asn lie lie 
15 

Gly Pro Gin His 
30 

Leu Glu Gly Glu 
45 

His Arg Gly Cys 

Met Pro Thr Thr 
80 

Tyr Ala Tyr Ala 
95 

Arg Arg Ala Gin 
110 

lie Ser His lie 
125 

Met Ser Val Phe 

Leu Met Glu Asp 
160 

He Gly Gly Val 
175 

Lys Lys Phe Leu 
190 

Leu Leu Asp Lys 
205 

Val Val Thr Gin 
Leu Arg Gly Thr 
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230 235 240 



Gly 


He 


Ala 


Tyr 


Asp 


He 


Arg 


Lys 


Glu 


Glu 


Pro 


Tyr 


Glu 


Leu 


Tyr 


Lys 








245 










250 










255 




Glu 


Leu 


Asp 


Phe 


Asp 


Val 


Pro 


Val 


Gly 


Asn 


Tyr 


Gly 


Asp 


Ser 


Tyr 


Asp 






260 










265 










270 






Arg 


Tyr 


Cys 


Leu 


Tyr 


Met 


Leu 


Glu 


He 


Asp 


Glu 


Ser 


Val 


Arg 


He 


He 




275 










280 










285 








Glu 


Gin 
290 


Leu 


He 


Pro 


Met 


Tyr 
295 


Ala 


Lys 


Thr 


Asp 


Thr 
300 


Pro 


He 


Met 


Ala 


Gin 


Asn 


Pro 


His 


Tyr 


He 


Ser 


Ala 


Pro 


Lys 


Glu 


Asp 


He 


Met 


Thr 


Gin 


305 










310 










315 










320 


Asn 


Tyr 


Ala 


Leu 


Met 


Gin 


His 


Phe 


Val 


Leu 


Val 


Ala 


Gin 


Gly 


Met 


Arg 








325 










330 










335 




Pro 


Pro 


Val 


Gly 
340 


Glu 


Val 


Tyr 


Ala 


Pro 
345 


Thr 


Glu 


Ser 


Pro 


Lys 
350 


Gly 


Glu 


Leu 


Gly 


Phe 


Phe 


He 


His 


Ser 


Glu 


Gly 


Glu 


Pro 


Tyr 


Pro 


His 


Arg 


Leu 




355 










360 










365 








Lys 


He 


Arg 


Ala 


Pro 


Ser 


Phe 


Tyr 


His 


He 


Gly 


Ala 


Leu 


Ser 


Asp 


He 


370 








375 










380 










Leu 


Val 


Gly 


Gin 


Tyr 


Leu 


Ala 


Asp 


Ala 


Val 


Thr 


Val 


He 


Gly 


Ser 


Thr 


385 








390 










395 










400 


Asn 


Ala 


Val 


Phe 


Gly 


Glu 


Val 


Asp 


Arg 

















405 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 358... 732 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 45: 

TAACTTGTGG TTAACTACCG CCAGACTCCT TTTGAGTTTG GCAAACGCGC CAATGAGTTC 6 0 

TTTAGGCATT TTTTCAGTGC CGATCTTAAA GTTTTCAAGA CTGCGTTGCG TTTGAGCCCC 12 0 

CCAATATTGG CTATCATTTA CTTTGATTTC GCCCATCGTG TCATGTTCAA TTCTAAATTG 18 0 

CATGCTAATC CTTTGAAATT TGATTTTAAA ACCTTAAAAA AATAGCATAA ACTCTTATAC 24 0 

CTTCTACTTA AAAACCCTAA TTTTTTAAAC ACCATTTCCA CAATTTTTAC ACAAAAGAGG 3 00 

GTTATTATCC GTTCGCAACA AGAATTTTCT TGTTATCTTA ATGTAAAGGT CAAAACG ATG 3 60 

Met 
1 

AAA AAG TTA GCC GCT TTA TTT TTA GTA AGC GTG TTG GGG GTT ATG GGT 4 08 

Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met Gly 
5 10 15 
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TTA AAC GCA TGG GAG CAA ACC CTA AAA GCT AAT GAC TTG GAA GTG AAA 4 56 

Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val Lys 

20 25 30 

ATC AAA TCC GTG GGT AAC CCC ATT AAA GGC GAT AAC ACT TTC ATT CTC 5 04 

lie Lys Ser Val Gly Asn Pro lie Lys Gly Asp Asn Thr Phe lie Leu 
35 40 45 

AGC CCC ACT TTA AAA GGT AAG GCT TTA GAA AAA GCT ATC GTT AGG GTG 552 
Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala lie Val Arg Val 
50 55 60 65 

CAG TTT ATG ATG CCT GAA ATG CCC GGC ATG CCA GCG ATG AAA GAA ATG 6 00 

Gin Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu Met 
70 75 80 

GCG CAA GTG AGT GAA AAA AAC GGC CTT TAT GAA GCT AAA ACC AAT CTT 64 8 

Ala Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn Leu 
85 90 95 

TCT ATG AAC GGG ACA TGG CAG GTT AGG GTG GAT ATT AAA TCT AAA GAG 6 96 

Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp lie Lys Ser Lys Glu 
100 105 110 

GGT CAG GTT TAT CGC GCT AAA ACA AGC CTG GAT TTA T AAG AG CATG CTATCT 74 8 
Gly Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 
115 120 125 

TTTATAAGCG CGTTTGATAA AAGGGGCGTT TCAATACGCC TTTTAACAGC CTTGTTACTG 808 
CTTTTTAGTT TGGGTTTGGC TAAAGATTTA GAGATCCAAT CTTTTGTGGC TAAATACCTT 868 
T 869 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 4 6 : 

Met Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Val Leu Gly Val Met 

1 5 10 15 

Gly Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val 

20 25 30 

Lys lie Lys Ser Val Gly Asn Pro lie Lys Gly Asp Asn Thr Phe lie 

35 40 45 

Leu Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala lie Val Arg 

50 55 60 

Val Gin Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu 
65 70 75 80 
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Met Ala Gin Val Ser Giu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn 

85 90 95 

Leu Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp He Lys Ser Lys 

100 105 HO 

Glu Gly Gin Val Tyr Arg Ala Lys Thr Ser Leu Asp Leu 

115 120 125 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 73... 1152 
(D) OTHER INFORMATION: 



<xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 7 : 

TCCATGCGTT TTGATGCGAT TTTAAAAAAT CTTTGGGTAT TTTAGCATGC CAATGGTTAA 6 0 

AAAAAGGTGG TT ATG AAT GGT TTT TGC GCT AGA CTA CGA GCC ATA ACT CAT 111 
Met Asn Gly Phe Cys Ala Arg Leu Arg Ala lie Thr His 
15 10 

AAT GAA AGA TTA AAA ATG AAA ATA GCG GTA TTA CTC AGT GGG GGG GTG 15 9 

Asn Glu Arg Leu Lys Met Lys lie Ala Val Leu Leu Ser Gly Gly Val 
15 20 25 

GAT AGC TCT TAT AGC GCT TAT AGC TTA AAA GAG CAA GGG CAT GAA TTA 2 07 

Asp Ser Ser Tyr Ser Ala Tyr Ser Leu Lys Glu Gin Gly His Glu Leu 
30 35 40 45 

GTG GGG ATT TAT TTA AAA CTC CAT GCG AGT GAA AAA AAG CAT GAT TTA 2 55 

Val Gly He Tyr Leu Lys Leu His Ala Ser Glu Lys Lys His Asp Leu 
50 55 60 

TAC ATC AAA AAC GCT CAA AAA GCA TGC GAG TTT TTA GGC ATT CCT TTA 3 03 

Tyr He Lys Asn Ala Gin Lys Ala Cys Glu Phe Leu Gly He Pro Leu 
65 70 75 

GAG GTG TTG GAT TTT CAA AAG GAT TTT AAA AGC GCG GTT TAT GAT GAA 3 51 

Glu Val Leu Asp Phe Gin Lys Asp Phe Lys Ser Ala Val Tyr Asp Glu 
80 85 90 

TTT ATC AAC GCC TAT GAA GAA GGG CAA ACC CCA AAC CCT TGT GCG TTG 3 99 

Phe He Asn Ala Tyr Glu Glu Gly Gin Thr Pro Asn Pro Cys Ala Leu 
95 100 105 

TGC AAC CCT TTA ATG AAG TTT GGG CTA GCT TTG GAT CAC GCT TTA AAA 44 7 
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Cys Asn Pro Leu Met Lys Phe Gly Leu Ala Leu Asp His Ala Leu Lys 
110 115 120 125 

TTA GGG TGT GAA AAG ATC GCT ACC GGG CAT TAT GCG AGA GTC AAA GAA 4 95 

Leu Gly Cys Glu Lys lie Ala Thr Gly His Tyr Ala Arg Val Lys Glu 
130 135 140 

ATT GAC AAA ATA AGT TAT ATT CAA GAG GCT TTG GAT AAA ACT AAA GAT 54 3 

lie Asp Lys lie Ser Tyr lie Gin Glu Ala Leu Asp Lys Thr Lys Asp 
145 150 155 

CAG AGC TAT TTT TTA TAC GCT TTA GAG CAT GAA GTG ATC GCT AAA TTG 591 
Gin Ser Tyr Phe Leu Tyr Ala Leu Glu His Glu Val lie Ala Lys Leu 
160 165 170 

GTG TTC CCT TTA GGG GAT TTG CTA AAA AAG GAT ATT AAG CCT TTA GCC 63 9 

Val Phe Pro Leu Gly Asp Leu Leu Lys Lys Asp lie Lys Pro Leu Ala 
175 180 185 

TTG AAT GCG ATG CCT TTT TTA GGC ACT TTA GAG ACT TAT AAG GAA TCT 6 87 

Leu Asn Ala Met Pro Phe Leu Gly Thr Leu Glu Thr Tyr Lys Glu Ser 
190 195 200 205 

CAA GAA ATC TGC TTT GTG GAA AAA AGC TAC ATT GAC ACT TTA AAA AAG 73 5 

Gin Glu lie Cys Phe Val Glu Lys Ser Tyr lie Asp Thr Leu Lys Lys 
210 215 220 

CAT GTT GAA GTG GAA AAA GAG GGC GTG GTG AAA AAC CTA CAA GGC GAA 7 83 

His Val Glu Val Glu Lys Glu Gly Val Val Lys Asn Leu Gin Gly Glu 
225 230 235 

GTC ATT GGC ACG CAT AAA GGC TAT ATG CAA TAC ACG ATT GGC AAA CGC 831 
Val lie Gly Thr His Lys Gly Tyr Met Gin Tyr Thr lie Gly Lys Arg 
240 245 250 

AAA GGC TTT AGT ATT AAA GGC GCG TTA GAG CCG CAT TTT GTG GTG GGG 87 9 

Lys Gly Phe Ser lie Lys Gly Ala Leu Glu Pro His Phe Val Val Gly 
255 260 265 

ATT GAC GCT AAA AAG AAC GAG CTA GTC GTG GGC AAA AAA GAA GAT CTC 92 7 

lie Asp Ala Lys Lys Asn Glu Leu Val Val Gly Lys Lys Glu Asp Leu 
270 275 280 285 

GCC ACG CAT TCG CTT AAG GCT AAA AAC AAA TCT TTA ATG AAA GAT TTT 97 5 

Ala Thr His Ser Leu Lys Ala Lys Asn Lys Ser Leu Met Lys Asp Phe 
290 295 300 

AAA GAT GGC GAA TAT TTT ATC AAG GCT CGT TAC AGG AGC GTG CCT GCT 102 3 

Lys Asp Gly Glu Tyr Phe lie Lys Ala Arg Tyr Arg Ser Val Pro Ala 
305 310 315 

AAA GCG CAT GTG AGT TTG AAA GAT GAG GTG ATT GAA GTG GGG TTT AAA 10 71 

Lys Ala His Val Ser Leu Lys Asp Glu Val lie Glu Val Gly Phe Lys 
320 325 330 
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GAG CCT TTT TAT GGC GTG GCT AAA GGG CAA GCT TTG GTC GTT TAT AAA 1119 

Glu Pro Phe Tyr Gly Val Ala Lys Gly Gin Ala Leu Val Val Tyr Lys 
335 340 345 

GAT GAC ATC TTG CTT GGT GGG GGC GTG ATT GTT TAAAAACTAA AGAACTAAGA 117 2 

Asp Asp lie Leu Leu Gly Gly Gly Val lie Val 
350 355 360 

GATACGCCTT TTGGCAGTCT CTTAATGTTT TATTGAATAG GCGTT 1217 



(2) INFORMATION FOR SEQ ID NO : 4 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



Met 


Asn 


Gly 


Phe 


Cys 


Ala 


Arg 


Leu 


Arg 


Ala 


He 


Thr 


His 


Asn 


Glu 


Arg 


1 






5 










10 










15 




Leu 


Lys 


Met 


Lys 


He 


Ala 


Val 


Leu 


Leu 


Ser 


Gly 


Gly 


val 


Asp 


Ser 


Ser 






20 










25 










30 






Tyr 


Ser 


Ala 


Tyr 


Ser 


Leu 


Lys 


Glu 


Gin 


Gly 


His 


Glu 


Leu 


Val 


Gly 


He 




35 










40 










45 








Tyr 


Leu 


Lys 


Leu 


His 


Ala 


Ser 


Glu 


Lys 


Lys 


His 


Asp 


Leu 


Tyr 


He 


Lys 


50 








55 










60 










Asn 


Ala 


Gin 


Lys 


Ala 


Cys 


Glu 


Phe 


Leu 


Gly 


He 


Pro 


Leu 


Glu 


Val 


Leu 


65 








70 










75 










80 


Asp 


Phe 


Gin 


Lys 


Asp 


Phe 


Lys 


Sex- 


Ala 


Val 


Tyr 


Asp 


Glu 


Phe 


He 


Asn 








85 










90 










95 




Ala 


Tyr 


Glu 


Glu 


Gly 


Gin 


Thr 


Pro 


Asn 


Pro 


Cys 


Ala 


Leu 


Cys 


Asn 


Pro 






100 










105 










110 






Leu 


Met 


Lys 


Phe 


Gly 


Leu 


Ala 


Leu 


Asp 


His 


Ala 


Leu 


Lys 


Leu 


Gly 


Cys 






115 










120 










125 








Glu 


Lys 


He 


Ala 


Thr 


Gly 


His 


Tyr 


Ala 


Arg 


Val 


Lys 


Glu 


He 


Asp 


Lys 




130 










135 










14 0 










He 


Ser 


Tyr 


He 


Gin 


Glu 


Ala 


Leu 


Asp 


Lys 


Thr 


Lys 


Asp 


Gin 


Ser 


Tyr 


145 








150 










155 










160 


Phe 


Leu 


Tyr 


Ala 


Leu 


Glu 


His 


Glu 


Val 


He 


Ala 


Lys 


Leu 


Val 


Phe 


Pro 








165 










170 










175 




Leu 


Gly 


Asp 


Leu 


Leu 


Lys 


Lys 


Asp 


He 


Lys 


Pro 


Leu 


Ala 


Leu 


Asn 


Ala 




180 










185 










190 


Glu 


He 


Met 


Pro 


Phe 


Leu 


Gly 


Thr 


Leu 


Glu 


Thr 


Tyr 


Lys 


Glu 


Ser 


Gin 






195 










200 










205 




Val 


Glu 


Cys 


Phe 


Val 


Glu 


Lys 


Ser 


Tyr 


He 


Asp 


Thr 


Leu 


Lys 


Lys 


His 


210 










215 










220 










Val 


Glu 


Lys 


Glu 


Gly 


Val 


Val 


Lys 


Asn 


Leu 


Gin 


Gly 


Glu 


Val 


He 


Gly 


225 








230 










235 








Gly 


240 


Thr 


His 


Lys 


Gly 


Tyr 


Met 


Gin 


Tyr 


Thr 


He 


Gly 


Lys 


Arg 


Lys 


Phe 






245 










250 










255 
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Ser lie Lys Gly Ala Leu Glu Pro His Phe Val Val Gly lie Asp Ala 

260 265 270 

Lys Lys Asn Glu Leu Val Val Gly Lys Lys Glu Asp Leu Ala Thr His 

275 280 285 

Ser Leu Lys Ala Lys Asn Lys Ser Leu Met Lys Asp Phe Lys Asp Gly 

290 295 300 

Glu Tyr Phe lie Lys Ala Arg Tyr Arg Ser Val Pro Ala Lys Ala His 
305 310 315 320 

Val Ser Leu Lys Asp Glu Val lie Glu Val Gly Phe Lys Glu Pro Phe 

325 330 335 

Tyr Gly Val Ala Lys Gly Gin Ala Leu Val Val Tyr Lys Asp Asp lie 

340 345 350 

Leu Leu Gly Gly Gly Val lie Val 

355 360 

(2) INFORMATION FOR SEQ ID NO: 49: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 191... 793 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 9 : 

ACATTACACA TATCTGTCGC TAAAACGCGC CG CTTC ACTA AACCCACTGA TTGTAAAAAT 60 

TTGTCTATTC GCATGCGTTT ATTTTACCCT ATTCTTTAAG TTTTTATCCA TAACTTATAA 12 0 

GGGTTTTAGT TTTAGCATGT TAG C ATTCAG CCACCACTCT TTTTAAGGAA TTTGTTTGAA 180 

GTTTCAAATT ATG AGT TTG TTA GCC ACT CTT TTA TTA GCC TCT TGC TTG 22 9 
Met Ser Leu Leu Ala Thr Leu Leu Leu Ala Ser Cys Leu 
15 10 

CCC CCC AAA GGC CAT CAT TCT GGT TTG GTG AAT CTT TAT ATC GCT CAT 2 77 

Pro Pro Lys Gly His His Ser Gly Leu Val Asn Leu Tyr lie Ala His 
15 20 25 

CAA GGC CAA AGC GTG CGC ACT TAT TGG CGC AAA GTG GAT AGA GGA GTT 3 25 

Gin Gly Gin Ser Val Arg Thr Tyr Trp Arg Lys Val Asp Arg Gly Val 
30 35 40 45 

ATC GCT AAA CAC AAT GAA GCG CTT AAA AAA GAT CCT AAA GCA AAG CTC 3 73 

lie Ala Lys His Asn Glu Ala Leu Lys Lys Asp Pro Lys Ala Lys Leu 
50 55 60 

AAA GAC CCC AGG GGG CCT TTA TTC ATG CTA GGG AGT GAG CGC TTC ATG 421 
Lys Asp Pro Arg Gly Pro Leu Phe Met Leu Gly Ser Glu Arg Phe Met 
65 70 75 
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CTT TTA TGG 
Leu Leu Trp 
80 

CTA GAG CCT 
Leu Glu Pro 
95 

AAA GGC GTC 
Lys Gly Val 
110 

TAT GAT TTC 
Tyr Asp Phe 



AAA AAC CGC 
Lys Asn Arg 



GGT TTT TAT 
Gly Phe Tyr 



TTG CAG AGC 
Leu Gin Ser 
115 

AAA AAC AAC 
Lys Asn Asn 
130 



TAG GCT TTA 
Tyr Ala Leu 
85 

TAC TTG GAT 
Tyr Leu Asp 
100 

GCT CCT GGC 
Ala Pro Gly 



CGC CCC TTT 
Arg Pro Phe 



GCC AAG CCC 
Ala Lys Pro 



TCT TTT AGC 
Ser Phe Ser 
105 

TAT TCA TAT 
Tyr Ser Tyr 
120 

TTC CTG GCC 
Phe Leu Ala 
135 



CAA TCG TTC 
Gin Ser Phe 
90 

GTG GAA ACT 
Val Glu Thr 



ACT AAA AAT 
Thr Lys Asn 

TTT GAA GTC 
Phe Glu Val 
140 



AGG 4 6 9 

Arg 



CAA 517 
Gin 



GGC 565 

Gly 

125 

AAA 613 
Lys 



CCT GAT GGC AAA ACC ATT CTT CCT AGC GTG GAA TTA AGC CTG ATT AAA 661 

Pro Asp Gly Lys Thr lie Leu Pro Ser Val Glu Leu Ser Leu lie Lys 

145 150 155 

ACC CCT AGA GGC TTT TTA GGG GTG TTC TTG TTT GAT AAT AAT GAA AAG 70 9 

Thr Pro Arg Gly Phe Leu Gly Val Phe Leu Phe Asp Asn Asn Glu Lys 

160 165 170 



GGG ACT AAC GCC AAG TGG ATT GAG GGG AGT TTG AAT TTA AAG CTT AAA 7 57 

Gly Thr Asn Ala Lys Trp lie Glu Gly Ser Leu Asn Leu Lys Leu Lys 
175 180 185 

AAC GCT TCC TTT AAA GAT GCG TGG GGG TTG GAA CAA TAAAGCATGA AGTGAT 80 9 
Asn Ala Ser Phe Lys Asp Ala Trp Gly Leu Glu Gin 
190 195 200 

CGCTTGCTTT TCGTAAGCTC TTTATGATTA GATTGTAAAA AAATGCCTTG AGTATTTTTT 869 

AGATTTTATT ACCCCTATTC AATTGGAACA AAG C CAT T AA ATTTTTAAAA ACTTTTAAAA 92 9 

ACGATAAACA TAATCCGCGC TCCAAGTAAC ATAGCTTTCA AAAATG 97 5 



(2) INFORMATION FOR SEQ ID NO: 50: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 01 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Leu Leu Ala Thr Leu Leu Leu Ala Ser Cys Leu Pro Pro Lys 

15 10 15 

Gly His His Ser Gly Leu Val Asn Leu Tyr lie Ala His Gin Gly Gin 

20 25 30 

Ser Val Az*g Thr Tyr Trp Arg Lys Val Asp Arg Gly Val lie Ala Lys 

35 40 45 

His Asn Glu Ala Leu Lys Lys Asp Pro Lys Ala Lys Leu Lys Asp Pro 
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50 55 60 

Arg Gly Pro Leu Phe Met Leu Gly Ser Glu Arg Phe Met Leu Leu Trp 
65 70 75 80 

Lys Asn Arg Tyr Ala Leu Ala Lys Pro Gin Ser Phe Arg Leu Glu Pro 

85 90 95 

Gly Phe Tyr Tyr Leu Asp Ser Phe Ser Val Glu Thr Gin Lys Gly Val 

100 105 110 

Leu Gin Ser Ala Pro Gly Tyr Ser Tyr Thr Lys Asn Gly Tyr Asp Phe 

115 120 125 

Lys Asn Asn Arg Pro Phe Phe Leu Ala Phe Glu Val Lys Pro Asp Gly 

130 135 140 

Lys Thr lie Leu Pro Ser Val Glu Leu Ser Leu lie Lys Thr Pro Arg 
145 150 155 160 

Gly Phe Leu Gly Val Phe Leu Phe Asp Asn Asn Glu Lys Gly Thr Asn 

165 170 175 

Ala Lys Trp lie Glu Gly Ser Leu Asn Leu Lys Leu Lys Asn Ala Ser 

180 185 190 

Phe Lys Asp Ala Trp Gly Leu Glu Gin 
195 200 

(2) INFORMATION FOR SEQ ID NO: 51: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1116 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 90... 1076 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

TATAAATACA TCGTTTCATT AGCGAATTTA ATGGCGTTAA GCGATCATAT TGATTTATTT 6 0 

TATGAATTTG TTTATTAAGG GAAAAAATC ATG TCA AAT AGC ATG TTG GAT AAA 113 

Met Ser Asn Ser Met Leu Asp Lys 
1 5 

AAT AAA GCG ATT CTT ACA GGG GGT GGG GCT TTA TTA TTA GGG CTA ATC 161 

Asn Lys Ala lie Leu Thr Gly Gly Gly Ala Leu Leu Leu Gly Leu lie 

10 15 20 

GTG CTT TTT TAT TTA GCT TAT CGC CCT AAG GCT GAA GTG TTG CAA GGG 2 09 

Val Leu Phe Tyr Leu Ala Tyr Arg Pro Lys Ala Glu Val Leu Gin Gly 
25 30 35 40 

TTT TTG GAA GCC AGA GAA TAC AGC GTG AGC TCC AAA GTC CCT GGC CGC 2 57 

Phe Leu Glu Ala Arg Glu Tyr Ser Val Ser Ser Lys Val Pro Gly Arg 
45 50 55 
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ATT GAA AAG GTG TTT GTT AAA AAA GGC GAT CAC ATT AAA AAG GGC GAT 3 05 

lie Glu Lys Val Phe Val Lys Lys Gly Asp His lie Lys Lys Gly Asp 
60 65 70 

TTG GTT TTT AGC ATT TCT AGC CCT GAA TTA GAA GCC AAA CTC GCT CAA 3 53 

Leu Val Phe Ser lie Ser Ser Pro Glu Leu Glu Ala Lys Leu Ala Gin 
75 80 85 

GCT GAA GCC GGG CAT AAA GCC GCT AAA GCG CTT AGC GAT GAA GTC AAA 4 01 

Ala Glu Ala Gly His Lys Ala Ala Lys Ala Leu Ser Asp Glu Val Lys 
90 95 100 

AGA GGC TCA AGA GAC GAA ACG ATT AAT TCT GCG AGA GAC GTT TGG CAA 44 9 

Arg Gly Ser Arg Asp Glu Thr lie Asn Ser Ala Arg Asp Val Trp Gin 
105 HO H5 120 

GCA GCC AAA TCC CAA GCC ACT TTA GCC AAA GAG ACT TAT AAG CGC GTT 4 97 

Ala Ala Lys Ser Gin Ala Thr Leu Ala Lys Glu Thr Tyr Lys Arg Val 
125 130 135 

CAA GAT TTG TAT GAT AAT GGC GTG GCG AGC TTG CAA AAG CGC GAT GAA 54 5 

Gin Asp Leu Tyr Asp Asn Gly Val Ala Ser Leu Gin Lys Arg Asp Glu 
140 145 150 

GCC TAT GCG GCT TAT GAA AGC ACT AAA TAC AAC GAG AGC GCG GCT TAC 5 93 

Ala Tyr Ala Ala Tyr Glu Ser Thr Lys Tyr Asn Glu Ser Ala Ala Tyr 
155 160 165 

CAA AAG TAT AAA ATG GCT TTA GGG GGG GCG AGC TCT GAA AGT AAG ATT 641 
Gin Lys Tyr Lys Met Ala Leu Gly Gly Ala Ser Ser Glu Ser Lys lie 
170 175 180 

GCC GCT AAG GCT AAA GAG AGC GCG GCT TTA GGG CAA GTG AAT GAA GTG 68 9 

Ala Ala Lys Ala Lys Glu Ser Ala Ala Leu Gly Gin Val Asn Glu Val 
185 190 195 200 

GAG TCT TAT TTA AAA GAC GTC AAA GCG ACA GCC CCA ATT GAT GGG GAA 73 7 

Glu Ser Tyr Leu Lys Asp Val Lys Ala Thr Ala Pro lie Asp Gly Glu 
205 210 215 

GTG AGT AAC GTG CTT TTA AGC GGT GGC GAG CTT AGC CCT AAG GGT TTT 7 85 

Val Ser Asn Val Leu Leu Ser Gly Gly Glu Leu Ser Pro Lys Gly Phe 
220 225 230 

CCT GTG GTT TTA ATG ATA GAT TTA AAG GAT AGT TGG TTA AAA ATC AGC 83 3 

Pro Val Val Leu Met lie Asp Leu Lys Asp Ser Trp Leu Lys lie Ser 
235 240 245 

GTG CCT GAA AAG TAT TTG AAC GAG TTT AAA GTG GGT AAG GAA TTT GAA 881 
Val Pro Glu Lys Tyr Leu Asn Glu Phe Lys Val Gly Lys Glu Phe Glu 
250 255 260 

GGC TAT ATC CCG GCG TTG AAA AAA AGC ACG AAA TTC AGG GTC AAA TAT 92 9 

Gly Tyr" lie Pro Ala Leu Lys Lys Ser Thr Lys Phe Arg Val Lys Tyr 
265 270 275 280 
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TTG AGC GTG ATG GGG GAT TTT GCG ACT TGG AAA GCG ACG AAT AAT TCC 97 7 

Leu Ser Val Met Gly Asp Phe Ala Thr Trp Lys Ala Thr Asn Asn Ser 
285 290 295 

AAC ACT TAC GAC ATG AAA AGC TAT GAA GTG GAA GCC ATA CCC TTA GAA 102 5 

Asn Thr Tyr Asp Met Lys Ser Tyr Glu Val Glu Ala lie Pro Leu Glu 
300 305 310 

GAG TTG GAA AAT TTT AGG GTA GGG ATG AGC GTG TTA GTT ACC ATT AAA 1073 
Glu Leu Glu Asn Phe Arg Val Gly Met Ser Val Leu Val Thr lie Lys 
315 320 325 

CCT TAAAAAGGAT TGTTTTGTTC AGATTGATAA GCGCATGGGT 1116 
Pro 



(2) INFORMATION FOR SEQ ID NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 52 : 

Met Ser Asn Ser Met Leu Asp Lys Asn Lys Ala lie Leu Thr Gly Gly 

15 10 15 

Gly Ala Leu Leu Leu Gly Leu lie Val Leu Phe Tyr Leu Ala Tyr Arg 

20 25 30 

Pro Lys Ala Glu Val Leu Gin Gly Phe Leu Glu Ala Arg Glu Tyr Ser 

35 40 45 

Val Ser Ser Lys Val Pro Gly Arg lie Glu Lys Val Phe Val Lys Lys 

50 55 60 

Gly Asp His lie Lys Lys Gly Asp Leu Val Phe Ser lie Ser Ser Pro 
65 70 75 80 

Glu Leu Glu Ala Lys Leu Ala Gin Ala Glu Ala Gly His Lys Ala Ala 

85 90 95 

Lys Ala Leu Ser Asp Glu Val Lys Arg Gly Ser Arg Asp Glu Thr lie 

100 105 110 

Asn Ser Ala Arg Asp Val Trp Gin Ala Ala Lys Ser Gin Ala Thr Leu 

115 120 125 

Ala Lys Glu Thr Tyr Lys Arg Val Gin Asp Leu Tyr Asp Asn Gly Val 

130 135 140 

Ala Ser Leu Gin Lys Arg Asp Glu Ala Tyr Ala Ala Tyr Glu Ser Thr 
145 150 155 160 

Lys Tyr Asn Glu Ser Ala Ala Tyr Gin Lys Tyr Lys Met Ala Leu Gly 

165 170 175 

Gly Ala Ser Ser Glu Ser Lys lie Ala Ala Lys Ala Lys Glu Ser Ala 

180 185 190 

Ala Leu Gly Gin Val Asn Glu Val Glu Ser Tyr Leu Lys Asp Val Lys 



BNSDOCID: <WO 9821225A1J_> 



WO 98/21225 



PCT/US97/21353 



-150- 

195 200 205 

Ala Thr Ala Pro lie Asp Gly Glu Val Ser Asn Val Leu Leu Ser Gly 

210 215 220 

Gly Glu Leu Ser Pro Lys Gly Phe Pro Val Val Leu Met lie Asp Leu 
225 230 235 240 

Lys Asp Ser Trp Leu Lys lie Ser Val Pro Glu Lys Tyr Leu Asn Glu 

245 250 255 

Phe Lys Val Gly Lys Glu Phe Glu Gly Tyr lie Pro Ala Leu Lys Lys 

260 265 270 

Ser Thr Lys Phe Arg Val Lys Tyr Leu Ser Val Met Gly Asp Phe Ala 

275 280 285 

Thr Trp Lys Ala Thr Asn Asn Ser Asn Thr Tyr Asp Met Lys Ser Tyr 

290 295 300 

Glu Val Glu Ala lie Pro Leu Glu Glu Leu Glu Asn Phe Arg Val Gly 
305 310 315 320 

Met Ser Val Leu Val Thr lie Lys Pro 
325 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 94... 1467 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

AAATAAAATA GCGATCATTA TAACATGTTG CTTTTTAAGT GAAAGCGTTA AGTTGTTAGG 6 0 

GTATAGTGGC TTAAAAATTT TAG GATATTG AGA ATG CTT GAA ACT TCT AGC CAT 114 

Met Leu Glu Thr Ser Ser His 
1 5 

TTT TTA AAA TCG TTT CGC TTG AAG CGT TAT ATA GGG TTT TTA TTG ATT 162 
Phe Leu Lys Ser Phe Arg Leu Lys Arg Tyr lie Gly Phe Leu Leu lie 
10 15 20 

TCT TTA GCG TTA TTA ATC ACG CCC TTT GTT CGC ATT GAT GGG GCG CAT 210 
Ser Leu Ala Leu Leu He Thr Pro Phe Val Arg He Asp Gly Ala His 
25 30 35 

TTG TTT TTG ATC TCT TTT GAG CAT AAG CAA CTG CAT TTT TTA GGC AAG 25 8 

Leu Phe Leu He Ser Phe Glu His Lys Gin Leu His Phe Leu Gly Lys 
40 45 50 55 

ATC TTT AGC GCT GAA GAA TTG CAA GTC ATG CCT TTT ATG GTT ATT TTG 3 06 

He Phe Ser Ala Glu Glu Leu Gin Val Met Pro Phe Met Val lie Leu 
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60 65 70 

CTT TTT ATA GGG ATT TTT TTC ATC ACC ACT AGC CTT GGG CGT GTG TGG 3 54 

Leu Phe lie Gly lie Phe Phe lie Thr Thr Ser Leu Gly Arg Val Trp 
75 80 85 

TGC GGT TGG GCT TGC CCG CAA ACC TTT TTA AGG GTG CTT TAT AGA GAT 4 02 

Cys Gly Trp Ala Cys Pro Gin Thr Phe Leu Arg Val Leu Tyr Arg Asp 
90 95 100 

GTG ATT GAA ACC AAG ATT TTC AAA CTC CAT AAA AAG ATC AGC AAC AAG 4 50 

Val lie Glu Thr Lys lie Phe Lys Leu His Lys Lys lie Ser Asn Lys 
105 110 115 

CAA GAA AGC CCT AAA AAC ACC CCA AGC TAC AAG ATC CGT AAA GTA TTG 4 98 

Gin Glu Ser Pro Lys Asn Thr Pro Ser Tyr Lys lie Arg Lys Val Leu 
120 125 130 135 

AGC GTT TTA TTG TTC GCT CCT GTT GTG GCG GGG CTA ATG ATG TTG TTT 54 6 

Ser Val Leu Leu Phe Ala Pro Val Val Ala Gly Leu Met Met Leu Phe 
140 145 150 

TTC TTT TAT TTC ATC GCC CCA GAA GAT TTT TTT ATG TAT CTT AAA AAC 5 94 

Phe Phe Tyr Phe lie Ala Pro Glu Asp Phe Phe Met Tyr Leu Lys Asn 
155 160 165 

CCT AGC GAT CAC CCT ATT GCT ATG GGT TTT TGG CTT TTT AGC ACG GCT 64 2 

Pro Ser Asp His Pro lie Ala Met Gly Phe Trp Leu Phe Ser Thr Ala 
170 175 180 

GTG GTG CTA TTT GAT ATA GTG GTG GTT GCG GAG CGT TTT TGC ATT TAT 690 
Val Val Leu Phe Asp lie Val Val Val Ala Glu Arg Phe Cys lie Tyr 
185 190 195 

TTA TGC CCT TAC GCT AGG GTG CAA TCG GTG TTG TAT GAC AAT GAC ACC 73 8 

Leu Cys Pro Tyr Ala Arg Val Gin Ser Val Leu Tyr Asp Asn Asp Thr 
200 205 210 215 

TTA AAC CCT ATT TAT GAT GAA AAG CGC GGC GGA GCG CTT TAT AAT AAT 7 86 

Leu Asn Pro lie Tyr Asp Glu Lys Arg Gly Gly Ala Leu Tyr Asn Asn 
220 225 230 

CAG GGC CAT CTC TTC CCC TTA CCT CCC AAA AAA CGC AGC CCA GAA AAC 8 34 

Gin Gly His Leu Phe Pro Leu Pro Pro Lys Lys Arg Ser Pro Glu Asn 
235 240 245 

GAA TGC GTG AAT TGT TTG CAT TGC GTG CAG GTT TGC CCC ACG CAT ATT 8 82 

Glu Cys Val Asn Cys Leu His Cys Val Gin Val Cys Pro Thr His lie 
250 255 260 

GAC ATC AGG AAG GGC TTG CAA TTA GAA TGC ATC AAT TGT TTA GAA TGC 93 0 

Asp lie Arg Lys Gly Leu Gin Leu Glu Cys lie Asn Cys Leu Glu Cys 
265 270 275 

GTG GAT GCA TGC ACG ATT ACC ATG GCT AAA TTT AAC CGC CCT TCA CTC 97 8 
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Val Asp Ala Cys Thr He Thr Met Ala Lys Phe Asn Arg Pro Ser Leu 
280 285 290 295 

ATC CAA TGG TCT TCA ACT AAC GCT ATT AAT ACG CGC CAA AAA GTG CAC 102 6 

He Gin Trp Ser Ser Thr Asn Ala He Asn Thr Arg Gin Lys Val His 
300 305 310 

CTG GTG CGT TTA AAA ACG ATC GCT TAC ATG GGG GTT ATC GCT ATT GTG 10 74 

Leu Val Arg Leu Lys Thr He Ala Tyr Met Gly Val He Ala He Val 
315 320 325 

ATC GCT CTT TTA GCC ATC ACT TCG TTT AAA AAA GAA CGC ATG CTC TTA 112 2 

He Ala Leu Leu Ala He Thr Ser Phe Lys Lys Glu Arg Met Leu Leu 
330 335 340 

GAC ATT AAC CGC AAC AGC GAT CTG TAT GAA TTG CGC TCT AGC GGG TAT 117 0 

Asp He Asn Arg Asn Ser Asp Leu Tyr Glu Leu Arg Ser Ser Gly Tyr 
345 350 355 

GTG GAT AAC GAT TAC GTG TTT TTA TTC CAC AAC ACG GAC AAT AAA GAC 1218 
Val Asp Asn Asp Tyr Val Phe Leu Phe His Asn Thr Asp Asn Lys Asp 
360 365 370 375 

CAT GAG TTT TAT TTC AAA GTT TTA GGG CAA AAA GAC ATT CAG ATC AAA 12 6 6 

His Glu Phe Tyr Phe Lys Val Leu Gly Gin Lys Asp He Gin He Lys 
380 385 390 

AAG CCT TTA AAT CCT ATC GCC ATT AAA GCC GGG CAA AAG ATT AAA GCG 1314 
Lys Pro Leu Asn Pro He Ala He Lys Ala Gly Gin Lys He Lys Ala 
395 400 405 

GTA GTG ATT TTA AGA AAA CCC CTA AAG AGT AAC GCC ACA GAA TAC AAG 13 6 2 

Val Val He Leu Arg Lys Pro Leu Lys Ser Asn Ala Thr Glu Tyr Lys 
410 415 420 

AAC GCT AAA GAC GCT CTA ATC CCC ATT ACC ATA CAA GCT TAT AGC GCG 1410 
Asn Ala Lys Asp Ala Leu He Pro He Thr He Gin Ala Tyr Ser Ala 
425 430 435 

GAC GAT AAG AAT ATT ACG ATA GAA AGG GAA TCG GTG TTT ATT GCA CCA 14 5 8 

Asp Asp Lys Asn He Thr He Glu Arg Glu Ser Val Phe He Ala Pro 
440 445 450 455 

AGT GAG GAT TGAAGCCTAA AACTAG CGTT CAATCACTTC ATAAGGCAAG CCTTGTT 1514 
Ser Glu Asp 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 



Met 


Leu 


Glu 


Thr 


2_ 








Tvr 
y 


lie 


Glv 


Phe 








2 0 


Val 


Arg 


lie 


Asp 






3 5 




Gin 


Leu 


His 


Phe 




50 






Met 


Pro 


Phe 


Met 


65 








Thr 


Ser 


Leu 


Gly 


Leu 


Arcr 


Val 


Leu 








10 0 


His 


Lys 


Lvs 
y o 


He 






115 




Tyr 


Lys 


He 


Arg 




130 






Ala 


Gly 


Leu 


Met 


14 5 








Phe 


Phe 


Met 


Tvr 


Phe 


Trn 

ir 


Leu 


Phe 








18 0 


Ala 


Glu 


Ara 


Phe 






195 




Val 


Leu 


Tyr 


Asp 




210 






Gly 


Gly 


Ala 


Leu 


225 








Lvs 


Lvs 


Arq 


Ser 


Gin 


Val 


Cys 


Pro 








260 


Cys 


He 


Asn 


Cys 






275 




Lys 


Phe 


Asn 


Arg 




290 






Asn 


Thr 


Arg 


Gin 


305 








Met 


Gly 


Val 


He 


Lys 


Lys 


Glu 


Arg 








340 


Glu 


Leu 


Arg 


Ser 






355 




His 


Asn 


Thr 


Asp 




370 






Gin 


Lys 


Asp 


He 



385 



Ser Ser His Phe 
5 

Leu Leu He Ser 

Gly Ala His Leu 
40 

Leu Gly Lys He 
55 

Val He Leu Leu 
70 

Arg Val Trp Cys 
85 

Tyr Arg Asp Val 

Ser Asn Lys Gin 
120 

Lys Val Leu Ser 
135 

Met Leu Phe Phe 
150 

Leu Lys Asn Pro 
165 

Ser Thr Ala Val 

Cys He Tyr Leu 
200 

Asn Asp Thr Leu 
215 

Tyr Asn Asn Gin 
230 

Pro Glu Asn Glu 
245 

Thr His He Asp 

Leu Glu Cys Val 
280 

Pro Ser Leu He 
295 

Lys Val His Leu 
310 

Ala He Val He 
325 

Met Leu Leu Asp 

Ser Gly Tyr Val 
360 

Asn Lys Asp His 
375 

Gin He Lys Lys 
390 



Leu Lys Ser Phe 
10 

Leu Ala Leu Leu 
25 

Phe Leu He Ser 

Phe Ser Ala Glu 
60 

Phe He Gly He 
75 

Gly Trp Ala Cys 
90 

He Glu Thr Lys 
105 

Glu Ser Pro Lys 

Val Leu Leu Phe 
140 

Phe Tyr Phe He 
155 

Ser Asp His Pro 
170 

Val Leu Phe Asp 
185 

Cys Pro Tyr Ala 

Asn Pro He Tyr 
220 

Gly His Leu Phe 
235 

Cys Val Asn Cys 
250 

He Arg Lys Gly 
265 

Asp Ala Cys Thr 

Gin Trp Ser Ser 
300 

Val Arg Leu Lys 
315 

Ala Leu Leu Ala 
330 

He Asn Arg Asn 
345 

Asp Asn Asp Tyr 

Glu Phe Tyr Phe 
380 

Pro Leu Asn Pro 
395 



Arg Leu Lys Arg 
15 

He Thr Pro Phe 
30 

Phe Glu His Lys 
45 

Glu Leu Gin Val 

Phe Phe He Thr 
80 

Pro Gin Thr Phe 
95 

He Phe Lys Leu 
110 

Asn Thr Pro Ser 
125 

Ala Pro Val Val 

Ala Pro Glu Asp 
160 

He Ala Met Gly 
175 

He Val Val Val 
190 

Arg Val Gin Ser 
205 

Asp Glu Lys Arg 

Pro Leu Pro Pro 
240 

Leu His Cys Val 
255 

Leu Gin Leu Glu 
270 

He Thr Met Ala 
285 

Thr Asn Ala He 

Thr He Ala Tyr 
320 

He Thr Ser Phe 
335 

Ser Asp Leu Tyr 
350 

Val Phe Leu Phe 
365 

Lys Val Leu Gly 

He Ala He Lys 
400 
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Ala Gly Gin Lys 

Ser Asn Ala Thr 
420 

Thr lie Gin Ala 
435 

Glu Ser Val Phe 
450 



He Lys Ala Val 
405 

Glu Tyr Lys Asn 

Tyr Ser Ala Asp 
440 

He Ala Pro Ser 
455 



Val He Leu Arg 
410 

Ala Lys Asp Ala 
425 

Asp Lys Asn He 
Glu Asp 



Lys Pro Leu Lys 
415 

Leu He Pro He 
430 

Thr He Glu Arg 
445 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 228... 782 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

ACGATTTGAT CAATAACGAA AATAAAATTG ATGAAATCAA TAATGAAGAA AACGCTGATC 6 0 

CTTCGCAAAA AAGAACGAAC AACGTTTTGC AACGAGCCAC TAACCACCAA GACAATCTCA 12 0 

ATTCCCCACT CAACAGGAAG TATTAAAGTG TGAAACTTTT TTCAAAGGAT TTATTTAAAA 180 

AAGTAACCCC TTTATTTTTA AGCGTTTATT TTTTAAACCC CACCATT ATG CAA GCC 2 36 

Met Gin Ala 
1 



AAA AGC CGT 
Lys Ser Arg 
5 

ATG AAA AAA 
Met Lys Lys 
20 

TCT TTA GGC 
Ser Leu Gly 



TAT TAC TTT 
Tyr Tyr Phe 



GGC GCT CAA 
Gly Ala Gin 
70 

GTG GAA TAC 



TTT TAT GTG 
Phe Tyr Val 



TAC AAC GAT 
Tyr Asn Asp 
25 

TGG GAG ATT 
Trp Glu He 
40 

TTT ATG GAT 
Phe Met Asp 

55 

GCG AAC ATG 
Ala Asn Met 



AAT AAA AAC 



GCT TCT CAA 

Ala Ser Gin 
10 

CTC AAA CGC 

Leu Lys Ai-g 

AAC CCC ACT 

Asn Pro Thr 



TAC GGG AAT 
Tyr Gly Asn 
60 

TTC ACT TAT 
Phe Thr Tyr 
75 

CCC TTG TAT 



TAC CAG GTG 
Tyr Gin Val 
15 

ACG ATT GAA 
Thr He Glu 
30 

AAC TAC TGG 
Asn Tyr Trp 

45 

GTC ATT CTC 
Val He Leu 



GGC TTT GGG 
Gly Phe Gly 



GTA TTT TCT 



GGG AAA ATG 
Gly Lys Met 



GGG GCG AGC 
Gly Ala Ser 



TTT TAT TCG 
Phe Tyr Ser 
50 

AAT AAA AGA 
Asn Lys Arg 
65 

GGG GAT TTG 
Gly Asp Leu 
80 

CTT TTT TAT 



ATC 2 84 

He 



TTT 332 

Phe 

35 

CGC 380 
Arg 



ACG 42 8 

Thr 



ATT 476 
He 



GGC 524 
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Val Glu Tyr Asn Lys Asn Pro Leu Tyr Val Phe Ser Leu Phe Tyr Gly 

85 90 95 

ATG CAA GTT GCT GAA AAC ACA TGG ACG ATT TCC AAA CAC AGC GCG AAT 5 72 

Met Gin Val Ala Glu Asn Thr Trp Thr lie Ser Lys His Ser Ala Asn 
100 105 110 115 

TTC ATC ATT GAC GAT TGG CGC AGC ATT CAA GGG TTT TCG CTC AAA ACT 62 0 

Phe lie lie Asp Asp Trp Arg Ser He Gin Gly Phe Ser Leu Lys Thr 
120 125 130 

TCC AAT TTT AGG ATG TTG GGT TTA GTG GGG TTT AAA TTC CAA ACC GTG 66 8 

Ser Asn Phe Arg Met Leu Gly Leu Val Gly Phe Lys Phe Gin Thr Val 
135 140 145 

CTA TTC CAC CAT GAC GCA AGT ATT GAA GTG GGG ATC AAA TGG CCT TTT 716 
Leu Phe His His Asp Ala Ser He Glu Val Gly He Lys Trp Pro Phe 
150 155 160 

GCT TTT GAA TAC GAC TCA GCC TTT GTA AGG CTT TTT TCT GTC TTT ATT 7 64 

Ala Phe Glu Tyr Asp Ser Ala Phe Val Arg Leu Phe Ser Val Phe He 
165 170 175 

TCG CAC ACT TTC TAC CTT TAAACTAATT CCAACCCTAC CGGGCAATGA TCGCTCCC 82 0 
Ser His Thr Phe Tyr Leu 
180 185 

TAAAATATCT TTATAGATTA AAG CGTCTTT TAAGCGCGTT TTTAAAGGGT TAG AG C ATAA 8 80 

AAAATAATCA ATGCGCCAAC CAATGTTTTT ATCCCTTGCT TGTTGCATGT AACTCCACCA 94 0 

GGTGTAAGCC TTTTCTTTGT TAGGGTAAAA ATAACGGAAA GTGTCAATAA 990 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Gin Ala Lys Ser Arg Phe Tyr Val Ala Ser Gin Tyr Gin Val Gly 

15 10 15 

Lys Met He Met Lys Lys Tyr Asn Asp Leu Lys Arg Thr He Glu Gly 

20 25 30 

Ala Ser Phe Ser Leu Gly Trp Glu He Asn Pro Thr Asn Tyr Trp Phe 

35 40 45 

Tyr Ser Arg Tyr Tyr Phe Phe Met Asp Tyr Gly Asn Val lie Leu Asn 

50 55. 60 

Lys Arg Thr Gly Ala Gin Ala Asn Met Phe Thr Tyr Gly Phe Gly Gly 

65 70 75 80 

Asp Leu He Val Glu Tyr Asn Lys Asn Pro Leu Tyr Val Phe Ser Leu 
85 90 95 
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Phe Tyr Gly Met Gin Val Ala Glu Asn Thr Trp Thr lie Ser Lys His 

100 105 HO 

Ser Ala Asn Phe He He Asp Asp Trp Arg Ser He Gin Gly Phe Ser 

115 120 125 

Leu Lys Thr Ser Asn Phe Arg Met Leu Gly Leu Val Gly Phe Lys Phe 

130 135 140 

Gin Thr Val Leu Phe His His Asp Ala Ser He Glu Val Gly He Lys 
145 150 155 160 

Trp Pro Phe Ala Phe Glu Tyr Asp Ser Ala Phe Val Arg Leu Phe Ser 

165 170 175 

Val Phe He Ser His Thr Phe Tyr Leu 
180 185 

(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ix) FEATURE : 

(A) NAME /KEY : Coding Sequence 
<B) LOCATION: 109... 1113 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

ATCTTACCTT TATCTTTTAA GATTTTATGA AAAATAGTTT CATTTTTACT ATTG TTATTT 6 0 

TCTTAGTAAT GTTATAATCG CTTTATAAAT CATACAAAAA GGATCGCT ATG TTA GTT 117 

Met Leu Val 
1 

ACT CGC TTT AAA AAA GCT TTC ATT TCT TAT TCT TTA GGC GTG CTT GTC 165 
Thr Arg Phe Lys Lys Ala Phe He Ser Tyr Ser Leu Gly Val Leu Val 
5 10 15 

GCT TCA TTA TGG TTG AAC GTG TGC AAC GCT TCA GCG CAA GAA GTC AAA 213 
Ala Ser Leu Trp Leu Asn Val Cys Asn Ala Ser Ala Gin Glu Val Lys 
20 25 30 35 

GTC AAG GAT TAT TTC GGG GAG CAA ACC ATC AAG CTT CCT GTT TCT AAA 261 
Val Lys Asp Tyr Phe Gly Glu Gin Thr He Lys Leu Pro Val Ser Lys 
40 45 50 

ATA GCC TAT ATA GGG AGC TAT GTA GAA GTG CCT GCC ATG CTT AAT GTT 3 09 

He Ala Tyr He Gly Ser Tyr Val Glu Val Pro Ala Met Leu Asn Val 
55 60 65 

TGG AAT AGG GTT GTA GGC GTT TCG GAT TAC GCT TTT AAA GAC GAT ATT 3 57 

Trp Asn Arg Val Val Gly Val Ser Asp Tyr Ala Phe Lys Asp Asp He 
70 75 80 
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GTC AAA GCC ACT CTC AAA GGC GAA GAT CTT AAA CGC GTC AAA CAC ATG 4 05 

Val Lys Ala Thr Leu Lys Gly Glu Asp Leu Lys Ax-g Val Lys His Met 

85 90 95 

AGC ACT GAT CAT ACA GCC GCG CTA AAT GTA GAG CTT TTA AAA AAG CTT 4 53 

Ser Thr Asp His Thr Ala Ala Leu Asn Val Glu Leu Leu Lys Lys Leu 
100 105 110 115 

AGC CCT GAT CTT GTG GTA ACC TTT GTG GGC AAC CCT AAA GCG GTA GAG 5 01 

Ser Pro Asp Leu Val Val Thr Phe Val Gly Asn Pro Lys Ala Val Glu 
120 125 130 

CAT GCG AAA AAA TTT GGT ATA TCA TTT CTT TCT TTT CAA GAG ACA ACG 54 9 

His Ala Lys Lys Phe Gly lie Ser Phe Leu Ser Phe Gin Glu Thr Thr 
135 140 145 

ATT GCA GAG GCC ATG CAG GCC ATG CAA GCT CAA GCC ACG GTT TTA GAG 5 97 

lie Ala Glu Ala Met Gin Ala Met Gin Ala Gin Ala Thr Val Leu Glu 
150 155 160 

ATT GAC GCT TCC AAA AAA TTC GCC AAA ATG CAA GAA ACT TTG GAT TTT 64 5 

lie Asp Ala Ser Lys Lys Phe Ala Lys Met Gin Glu Thr Leu Asp Phe 
165 170 175 

ATT GCT GAG CGT TTG AAA AAT GTC AAA AAG AAA AAG GGG GTG GAG CTT 6 93 

lie Ala Glu Arg Leu Lys Asn Val Lys Lys Lys Lys Gly Val Glu Leu 
180 185 190 195 

TTC CAT AAA GCC AAT AAA ATC AGC GGC CAT CAA GCC ATT AGC TCA GAC 741 
Phe His Lys Ala Asn Lys lie Ser Gly His Gin Ala lie Ser Ser Asp 
200 205 210 

ATT TTA GAA AAA GGG GGC ATA GAC AAT TTT GGC TTG AAA TAT GTC AAA 7 89 

lie Leu Glu Lys Gly Gly lie Asp Asn Phe Gly Leu Lys Tyr Val Lys 
215 220 225 

TTT GGG CGT GCT GAC ATT AGC GTG GAA AAA ATC GTT AAA GAA AAC CCT 837 
Phe Gly Arg Ala Asp lie Ser Val Glu Lys lie Val Lys Glu Asn Pro 
230 235 240 

GAG ATT ATC TTT ATT TGG TGG ATA AGC CCA CTC ACG CCT GAA GAT GTG 8 85 

Glu lie lie Phe lie Trp Trp He Ser Pro Leu Thr Pro Glu Asp Val 
245 250 255 

TTA AAC AAC CCC AAA TTT GCT ACC ATC AAA GCC ATT AAA AAC AAG CAG 93 3 

Leu Asn Asn Pro Lys Phe Ala Thr He Lys Ala He Lys Asn Lys Gin 
260 265 270 275 

GTT TAT AAA CTC CCC ACA ATG GAT ATT GGC GGG CCT AG A GCC CCA CTC 981 

Val Tyr Lys Leu Pro Thr Met Asp He Gly Gly Pro Arg Ala Pro Leu 

280 285 290 

ATA AGT CTT TTT ATC GCT CTA AAA GCC CAC CCT GAA GCC TTT AAG GGC 102 9 

He Ser Leu Phe He Ala Leu Lys Ala His Pro Glu Ala Phe Lys Gly 

295 300 305 
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GTG GAT ATT AAT GCG ATG GTT AAA GAC TAC TAT AAA GTG GTT TTT GAT 10 77 

Val Asp He Asn Ala Met Val Lys Asp Tyr Tyr Lys Val Val Phe Asp 
310 315 320 

TTG AAT GAT GCA GAG GTT GAG CCC TTT TTA TGG CAT TAATTTTTAA AAAGGG 112 9 

Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 
325 330 335 

GTTGATGTTT TTAGCCTTTC GTGTATCGCG CT 1161 



(2) INFORMATION FOR SEQ ID NO: 58: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



Met 


Leu 


Val 


Thr 


Arg 


Phe 


Lys 


Lys 


Ala 


Phe 


He 


Ser 


Tyr 


Ser 


Leu 


Gly 


1 








5 










10 










15 




Val 


Leu 


val 


Ala 


Ser 


Leu 


Trp 


Leu 


Asn 


Val 


Cys 


Asn 


Ala 


Ser 


Ala 


Gin 








20 










25 










30 






Glu 


Val 


Lys 


Val 


Lys 


Asp 


Tyr 


Phe 


Gly 


Glu 


Gin 


Thr 


He 


Lys 


Leu 


Pro 






35 










40 










45 








Val 


Ser 


Lys 


He 


Ala 


Tyr 


He 


Gly 


Ser 


Tyr 


Val 


Glu 


Val 


Pro 


Ala 


Met 




50 










55 










60 










Leu 


Asn 


Val 


Trp 


Asn 


Arg 


Val 


Val 


Gly 


Val 


Ser 


Asp 


Tyr 


Ala 


Phe 


Lys 


65 










70 










75 










80 


Asp 


Asp 


He 


Val 


Lys 


Ala 


Thr 


Leu 


Lys 


Gly 


Glu 


Asp 


Leu 


Lys 


Arg 


Val 






85 










90 










95 




Lys 


His 


Met 


Ser 


Thr 


Asp 


His 


Thr 


Ala 


Ala 


Leu 


Asn 


Val 


Glu 


Leu 


Leu 






100 










105 










110 






Lys 


Lys 


Leu 


Ser 


Pro 


Asp 


Leu 


Val 


Val 


Thr 


Phe 


Val 


Gly 


Asn 


Pro 


Lys 


115 










120 










125 








Ala 


Val 


Glu 


His 


Ala 


Lys 


Lys 


Phe 


Gly 


He 


Ser 


Phe 


Leu 


Ser 


Phe 


Gin 




130 










135 










140 










Glu 


Thr 


Thr 


He 


Ala 


Glu 


Ala 


Met 


Gin 


Ala 


Met 


Gin 


Ala 


Gin 


Ala 


Thr 


145 










150 










155 










160 


Val 


Leu 


Glu 


He 


Asp 


Ala 


Ser 


Lys 


Lys 


Phe 


Ala 


Lys 


Met 


Gin 


Glu 


Thr 










165 










170 










175 




Leu 


Asp 


Phe 


He 


Ala 


Glu 


Arg 


Leu 


Lys 


Asn 


Val 


Lys 


Lys 


Lys 


Lys 


Gly 






180 










185 










190 






Val 


Glu 


Leu 


Phe 


His 


Lys 


Ala 


Asn 


Lys 


He 


Ser 


Gly 


His 


Gin 


Ala 


He 






195 










200 










205 








Ser 


Ser 


Asp 


He 


Leu 


Glu 


Lys 


Gly 


Gly 


He 


Asp 


Asn 


Phe 


Gly 


Leu 


Lys 




210 










215 










220 










Tyr 


Val 


Lys 


Phe 


Gly 


Arg 


Ala 


Asp 


He 


Ser 


Val 


Glu 


Lys 


He 


Val 


Lys 


225 








230 










235 










240 


Glu 


Asn 


Pro 


Glu 


He 


He 


Phe 


He 


Trp 


Trp 


He 


Ser 


Pro 


Leu 


Thr 


Pro 
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Glu Asp Val Leu Asn Asn Pro Lys Phe Ala Thr lie Lys Ala lie Lys 

260 265 270 

Asn Lys Gin Val Tyr Lys Leu Pro Thr Met Asp lie Gly Gly Pro Arg 

275 280 285 

Ala Pro Leu lie Ser Leu Phe lie Ala Leu Lys Ala His Pro Glu Ala 

290 295 300 

Phe Lys Gly Val Asp lie Asn Ala Met Val Lys Asp Tyr Tyr Lys Val 
305 310 315 320 

Val Phe Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 
325 330 335 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 121... 669 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

TT ATTCG CAT GCATTAGCTA TTATTGAAGC TCAAAGCATT CAAGCGCATT TATTCTTAGA 6 0 

TGAAATCAAA CAAAG CC AAA AAGAAAAGAA AAAATTCCCC ACTTTCAAAG GAGGTTTTTA 12 0 

ATG CGT TGG TGG TGT TTT TTG GTG TGT TGT TTT GGT ATT TTA AGC GTG 16 8 
Met Arg Trp Trp Cys Phe Leu Val Cys Cys Phe Gly lie Leu Ser Val 
15 10 15 

ATG GAC GCT AAA AAA TTA GAG AAT AAG AAT TTG AAA AAA GAA AG A GAG 216 
Met Asp Ala Lys Lys Leu Glu Asn Lys Asn Leu Lys Lys Glu Arg Glu 
20 25 30 

CTT TTA GAG ATT ACT GGC AAC CAA TTT GTA GCG AAC GAC AAA ACC AAA 2 64 

Leu Leu Glu lie Thr Gly Asn Gin Phe Val Ala Asn Asp Lys Thr Lys 
35 40 45 

ACC GCT GTT ATT CAA GGC AAT GTG CAG ATC AAA AAG GGT AAA GAC CGG 312 
Thr Ala Val lie Gin Gly Asn Val Gin lie Lys Lys Gly Lys Asp Arg 
50 55 60 

TTG TTT GCG GAC AAG GTG AGC GTG TTT TTA AAC GAT AAA CGA AAG CCA 3 60 

Leu Phe Ala Asp Lys Val Ser Val Phe Leu Asn Asp Lys Arg Lys Pro 
65 70 75 80 

GAG CGC TAT GAA GCC ACA GGG AAC ACG CAT TTT AAC ATC TTT ACA GAG 4 08 

Glu Arg Tyr Glu Ala Thr Gly Asn Thr His Phe Asn lie Phe Thr Glu 
85 90 95 
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GAC AAT CGT GAA ATC AGC GGG AGT GCT GAC AAG CTC ATT TAT AAC GCG 4 56 

Asp Asn Arg Glu He Ser Gly Ser Ala Asp Lys Leu He Tyr Asn Ala 
100 105 HO 

CTG AAT GGG GAA TAC AAA TTA TTG CAA AAT GCG GTG GTT AG A GAA GTG 5 04 

Leu Asn Gly Glu Tyr Lys Leu Leu Gin Asn Ala Val Val Arg Glu Val 
115- 120 125 

GGG AAA TCC AAT GTC ATC ACC GGC GAT GAA ATC ATT TTA AAC AAA ACT 55 2 

Gly Lys Ser Asn Val He Thr Gly Asp Glu He He Leu Asn Lys Thr 
130 135 140 

AAG GGT TAT GCT GAT GTG TTG GGG AGC GCG AAA CGG CCC GCT AAA TTC 600 
Lys Gly Tyr Ala Asp Val Leu Gly Ser Ala Lys Arg Pro Ala Lys Phe 
145 150 155 160 

GTG TTT GAT ATG GAA GAT ATT AAT GAA GAA AAT CGT AAG GCT AAA TTG 64 8 

Val Phe Asp Met Glu Asp He Asn Glu Glu Asn Arg Lys Ala Lys Leu 
165 170 175 

AAG AAG AAA GGC GAA AAA CCA TGATTGTCAT TAAAGACGCT CATTTTCTCA CTTC 703 
Lys Lys Lys Gly Glu Lys Pro 
180 

TTCAAGCCAA CTTTTTCAAT GCCCTGCGAG TTTGACTTCT GAAATGGTGG TTTTAGGGCG 7 63 

CAGCAATGTA GGCAAAAGCT CGTTTATTAA TAC C TTG 8 00 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 60: 



Met 


Arg 


Trp 


Trp 


Cys 


Phe 


Leu 


Val 


Cys 


Cys 


Phe 


Gly 


He 


Leu 


Ser 


Val 


1 






5 










10 










15 




Met 


Asp 


Ala 


Lys 


Lys 


Leu 


Glu 


Asn 


Lys 


Asn 


Leu 


Lys 


Lys 


Glu 


Arg 


Glu 






20 










25 










30 






Leu 


Leu 


Glu 


He 


Thr 


Gly 


Asn 


Gin 


Phe 


Val 


Ala 


Asn 


Asp 


Lys 


Thr 


Lys 






35 










40 










45 








Thr 


Ala 


Val 


He 


Gin 


Gly 


Asn 


Val 


Gin 


He 


Lys 


Lys 


Gly 


Lys 


Asp 


Arg 




50 










55 










60 










Leu 


Phe 


Ala 


Asp 


Lys 


Val 


Ser 


Val 


Phe 


Leu 


Asn 


Asp 


Lys 


Arg 


Lys 


Pro 


65 








70 










75 










80 


Glu 


Arg 


Tyr 


Glu 


Ala 


Thr 


Gly 


Asn 


Thr 


His 


Phe 


Asn 


He 


Phe 


Thr 


Glu 






85 










90 










95 




Asp 


Asn 


Arg 


Glu 


He 


Ser 


Gly 


Ser 


Ala 


Asp 


Lys 


Leu 


He 


Tyr 


Asn 


Ala 




100 










105 










110 






Leu 


Asn 


Gly 


Glu 


Tyr 


Lys 


Leu 


Leu 


Gin 


Asn 


Ala 


Val 


Val 


Arg 


Glu 


Val 
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115 120 125 

Gly Lys Ser Asn Val lie Thr Gly Asp Glu lie lie Leu Asn Lys Thr 

130 135 140 

Lys Gly Tyr Ala Asp Val Leu Gly Ser Ala Lys Arg Pro Ala Lys Phe 
145 150 155 160 

Val Phe Asp Met Glu Asp lie Asn Glu Glu Asn Arg Lys Ala Lys Leu 

165 170 175 

Lys Lys Lys Gly Glu Lys Pro 
180 



(2) INFORMATION FOR SEQ ID NO : 6 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 724 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION : 88 . . . 618 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



GTGATGATTG AAAACATGTG AAAG AG CGTT TTTTAAG CTT TTAAATGGTG TTTGAATGCG 6 0 

AAAAAAAGGC TAATACTATC ATAAGGA ATG AAG TTG ATA AAA TTT GTG CGT AAT 114 

Met Lys Leu lie Lys Phe Val Arg Asn 
1 5 



GTG GTT TTG 
Val Val Leu 
10 

GTG AGT TAT 
Val Ser Tyr 



GAA GTC AAA 
Glu Val Lys 



GTA AAA ACC 
Val Lys Thr 
60 

AAA GAT CAA 
Lys Asp Gin 
75 

AGC TTC CCT 
Ser Phe Pro 



TTC ATT TTA 
Phe lie Leu 
15 

TGC ATG CCC 
Cys Met Pro 
30 

AGA ATG AAT 
Arg Met Asn 
45 

CTT GCT AGA 
Leu Ala Arg 



AAA AGC GTA 
Lys Ser Val 

TTT TAT TTT 
Phe Tyr Phe 



ACG GCG ATC 
Thr Ala lie 



CAT TAT AGC 
His Tyr Ser 



GAA AAT GAA 
Glu Asn Glu 
50 

GAT GTC TAT 
Asp Val Tyr 
65 

ACC GTT TAT 
Thr Val Tyr 
80 

AAG TTT AAT 
Lys Phe Asn 



TTT TTA GCG 
Phe Leu Ala 
20 

GCG GCT GTC 
Ala Ala Val 
35 

AAC ACG CCC 
Asn Thr Pro 



TTT GTG CAA 
Phe Val Gin 



CGT AAC GAA 
Arg Asn Glu 
85 

TCG GCT GAT 
Ser Ala Asp 



TTC ATG CTT 
Phe Met Leu 



ATT AGC GGG 
lie Ser Gly 
40 

AAT AAT AAG 
Asn Asn Lys 
55 

ACT TAC GAC 
Thr Tyr Asp 
70 

GAC ACG CGC 
Asp Thr Arg 



ATT TCA GCC 
lie Ser Ala 



TTG 162 

Leu 

25 

GTG 210 
Val 



GAA 258 
Glu 



CCT 3 06 

Pro 



TTT 3 54 

Phe 



CTC 4 02 

Leu 
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90 95 100 105 

GCT CAA AGT TTA ATC AAT CAG CAA GTG GAA GTG AAA TAC TAT GGT TGG 4 50 

Ala Gin Ser Leu lie Asn Gin Gin Val Glu Val Lys Tyr Tyr Gly Trp 
110 115 120 

CGG ATC AAT TTG TTT AAC ATG TTC CCT AAT GTG ATT TTT TTA AAG CCC 4 98 

Arg He Asn Leu Phe Asn Met Phe Pro Asn Val He Phe Leu Lys Pro 
125 130 135 

TTA AAA GAG AGC ACT GAC ATT TCA AAG CCC ATT TTT AGC TGG ATT TTA 54 6 

Leu Lys Glu Ser Thr Asp He Ser Lys Pro He Phe Ser Trp He Leu 
140 145 150 

TAC GCT TTG CTG TTA ATG GGC TTT TTT ATC AGC GCG CGT TCT GTT TGC 5 94 

Tyr Ala Leu Leu Leu Met Gly Phe Phe lie Ser Ala Arg Ser Val Cys 
155 160 165 

ACT TTA TTT AAG AGC AAA GCT CAT TAAAACTTTT AGGCTTTGTT GGAAAATCAC 64 8 

Thr Leu Phe Lys Ser Lys Ala His 
170 175 

AATGGGGTTA TTGGAGCGTG TATTAAAAAG CTCAATATAG GGCAAGCTGA TGCTGTGAAA 708 
AG CGGTGTTG TTTCCT 724 

(2) INFORMATION FOR SEQ ID NO : 62 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Met Lys Leu He Lys Phe Val Arg Asn Val Val Leu Phe He Leu Thr 

15 10 15 

Ala He Phe Leu Ala Phe Met Leu Leu Val Ser Tyr Cys Met Pro His 

20 25 30 

Tyr Ser Ala Ala Val He Ser Gly Val Glu Val Lys Arg Met Asn Glu 

35 40 45 

Asn Glu Asn Thr Pro Asn Asn Lys Glu Val Lys Thr Leu Ala Arg Asp 

50 55 60 

Val Tyr Phe Val Gin Thr Tyr Asp Pro Lys Asp Gin Lys Ser Val Thr 
65 70 75 80 

Val Tyr Arg Asn Glu Asp Thr Arg Phe Ser Phe Pro Phe Tyr Phe Lys 

85 90 95 

Phe Asn Ser Ala Asp He Ser Ala Leu Ala Gin Ser Leu He Asn Gin 

100 105 HO 

Gin Val Glu Val Lys Tyr Tyr Gly Trp Arg He Asn Leu Phe Asn Met 

115 120 125 

Phe Pro Asn Val He Phe Leu Lys Pro Leu Lys Glu Ser Thr Asp lie 
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130 135 140 

Ser Lys Pro lie Phe Ser Trp lie Leu Tyr Ala Leu Leu Leu Met Gly 
145 150 155 160 

Phe Phe lie Ser Ala Arg Ser Val Cys Thr Leu Phe Lys Ser Lys Ala 
165 170 175 

His 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 117... 911 
(D) OTHER INFORMATION: 



(A) NAME /KEY : sig_peptide 

(B) LOCATION: 117... 167 
(D) OTHER INFORMATION: 



(A) NAME /KEY : mat_peptide 

(B) LOCATION: 168... 911 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CCACATTTAA GGTAGAAACC ACTCAATTAG ATGTAAAAAT TCCAAACGGC AACCAAAAAA 6 0 

TGGTTAAAAA G G AC ACAAT A AACCCCAAAA ATGAAATTTA AATATATGGG AACTTA ATG 119 

Met 
-17 

AG A ATT TTT TTT GTT ATC ATG GGA CTT GTG TTT TTT GGT TGC ACC AGT 167 
Arg lie Phe Phe Val lie Met Gly Leu Val Phe Phe Gly Cys Thr Ser 
-15 -10 -5 

AAG GTG CAT GAG ATG AAA AAA AGC CCT TGC ACC TTG TAT GAA AAC AGG 215 
Lys Val His Glu Met Lys Lys Ser Pro Cys Thr Leu Tyr Glu Asn Arg 
15 10 15 

TTA AAT CTC GCA GAA ATC TTT CAC AAG CGA GCA ATT GAT CTA TTT AGA 2 63 

Leu Asn Leu Ala Glu lie Phe His Lys Arg Ala lie Asp Leu Phe Arg 
20 25 30 

GAG CTT TTA AGC CAC CAA GAA AAG CAT TTA GAA AAC AAG CTT TCT GGT 311 
Glu Leu Leu Ser His Gin Glu Lys His Leu Glu Asn Lys Leu Ser Gly 
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35 40 45 

TTT TCG GTG AGT GAT TTG GAC ATG CAA AGC GTG TTT CGG CTG GAA AGA 3 59 

Phe Ser Val Ser Asp Leu Asp Met Gin Ser Val Phe Arg Leu Glu Arg 
50 55 60 

AAC CGC TTG AAA ATC GCT TAC AAG CTC TTA GGC TTG ATG AGT TTT ATC 4 07 

Asn Arg Leu Lys lie Ala Tyr Lys Leu Leu Gly Leu Met Ser Phe He 
65 70 75 80 

GCT CTT ATT TTA GCG ATC GTG TTA ATC AGT CTT CTA CCC TTA CAA AAA 4 55 

Ala Leu He Leu Ala He Val Leu He Ser Leu Leu Pro Leu Gin Lys 
85 90 95 

ACC GAA CAC CAT TTC GTG GAT TTT TTA AAC CAG GAC AAG CAT TAC GTC 5 03 

Thr Glu His His Phe Val Asp Phe Leu Asn Gin Asp Lys His Tyr Val 
100 105 HO 

ATT ATC CAA AGA GCG GAT AAA AGC ATT TCC AGT AAT GAA GCG TTG GCT 551 
He He Gin Arg Ala Asp Lys Ser He Ser Ser Asn Glu Ala Leu Ala 
115 120 125 

CGT TCG CTC ATT GGG GCG TAT GTG TTA AAC CGA GAG AGC ATT AAC CGC 5 99 

Arg Ser Leu He Gly Ala Tyr Val Leu Asn Arg Glu Ser He Asn Arg 
130 135 140 

ATT GAC GAT AAA TCG CGC TAT GAA TTG GTG CGC TTG CAA AGC AGT TCT 64 7 

He Asp Asp Lys Ser Arg Tyr Glu Leu Val Arg Leu Gin Ser Ser Ser 
145 150 155 160 

AAA GTG TGG CAA CGC TTT GAA GAT TTG ATT AAA ACC CAA AAC AGC ATT 6 95 

Lys Val Trp Gin Arg Phe Glu Asp Leu He Lys Thr Gin Asn Ser He 
165 170 175 

TAT GTG CAA AGC CAT TTG GAA AGA GAA GTC CAT ATC GTC AAT ATT GCG 74 3 

Tyr Val Gin Ser His Leu Glu Arg Glu Val His He Val Asn He Ala 
180 185 190 

ATC TAT CAG CAA GAC AAT AAC CCC ATT GCG AGC GTC TCC ATT GCC GCT 7 91 

He Tyr Gin Gin Asp Asn Asn Pro He Ala Ser Val Ser He Ala Ala 
195 200 205 

AAA CTT TTG AAT GAA AAC AAG CTG GTG TAT GAA AAG CGT TAT AAA ATC 83 9 

Lys Leu Leu Asn Glu Asn Lys Leu Val Tyr Glu Lys Arg Tyr Lys He 
210 215 220 

GTA TTG AGT TAT TTG TTT GAC ACC CCG ATG AAT TCA AGC TTG CAA GCT 88 7 

Val Leu Ser Tyr Leu Phe Asp Thr Pro Met Asn Ser Ser Leu Gin Ala 
225 230 235 240 

TGC AAG CTC TCA GGC TTC ATA GTT TGACATGACA TATAGATGAG CTTTATGCGG 941 
Cys Lys Leu Ser Gly Phe He Val 
245 

TACGATTATC ACAGAATGGC TAACGCAGCA GGCACCGAGT A 982 
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(2) INFORMATION FOR SEQ ID NO: 64: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 

Met Arg lie Phe Phe Val lie Met Gly Leu Val Phe Phe Gly Cys Thr 

-17 -15 -10 -5 

Ser Lys Val His Glu Met Lys Lys Ser Pro Cys Thr Leu Tyr Glu Asn 

15 10 15 

Arg Leu Asn Leu Ala Glu lie Phe His Lys Arg Ala lie Asp Leu Phe 

20 25 30 

Arg Glu Leu Leu Ser His Gin Glu Lys His Leu Glu Asn Lys Leu Ser 

35 40 45 

Gly Phe Ser Val Ser Asp Leu Asp Met Gin Ser Val Phe Arg Leu Glu 

50 55 60 

Arg Asn Arg Leu Lys lie Ala Tyr Lys Leu Leu Gly Leu Met Ser Phe 

65 70 75 

lie Ala Leu He Leu Ala He Val Leu He Ser Leu Leu Pro Leu Gin 
80 85 90 95 

Lys Thr Glu His His Phe Val Asp Phe Leu Asn Gin Asp Lys His Tyr 

100 105 110 

Val He He Gin Arg Ala Asp Lys Ser He Ser Ser Asn Glu Ala Leu 

115 120 125 

Ala Arg Ser Leu He Gly Ala Tyr Val Leu Asn Arg Glu Ser He Asn 

130 135 140 

Arg He Asp Asp Lys Ser Arg Tyr Glu Leu Val Arg Leu Gin Ser Ser 

145 150 155 

Ser Lys Val Trp Gin Arg Phe Glu Asp Leu He Lys Thr Gin Asn Ser 
160 165 170 175 

He Tyr Val Gin Ser His Leu Glu Arg Glu Val His He Val Asn He 

180 185 190 

Ala He Tyr Gin Gin Asp Asn Asn Pro He Ala Ser Val Ser He Ala 

195 200 205 

Ala Lys Leu Leu Asn Glu Asn Lys Leu Val Tyr Glu Lys Arg Tyr Lys 

210 215 220 

He Val Leu Ser Tyr Leu Phe Asp Thr Pro Met Asn Ser Ser Leu Gin 

225 230 235 

Ala Cys Lys Leu Ser Gly Phe He Val 
240 245 



(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2059 base pairs 
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( B ) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 183... 1961 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GATATTTGTT TTGTTGGGGG TTAGGTTTTT GTTTAAGAAA GTTTTTTAAA ACTAAAGAAG 6 0 

CGCTTAAAAC AGAACCTTTT GTTTTTTAGG TTTTATTTTT TACTTTGGCT TGTTTTCAAA 12 0 

AGTCATTTTG ATTTCTAAAA ATAGTCTATA ATGCTCGCAA GAGATATTTT TTAAGGTTAT 180 

CA ATG AAA GCT ATA AAA ATA CTT TTT ATA ATG ACA CTC AGT TTA AAC 2 27 
Met Lys Ala lie Lys He Leu Phe He Met Thr Leu Ser Leu Asn 
1 5 10 15 

GCT ATC AGC GTG AAT AGG GCG TTG TTT GAT TTA AAA GAT TCG CAA TTA 27 5 

Ala He Ser Val Asn Arg Ala Leu Phe Asp Leu Lys Asp Ser Gin Leu 
20 25 30 

AAA GGG GAA TTA ACG CCA AAA ATA GTG AAT TTT GGG GGT TAT AAA AGC 3 23 

Lys Gly Glu Leu Thr Pro Lys He Val Asn Phe Gly Gly Tyr Lys Ser 
35 40 45 

AGC ACT GAA GAG TGG GGG GCT ACG GCT TTA AAC TAT ATC AAT GCG GCT 371 
Ser Thr Glu Glu Trp Gly Ala Thr Ala Leu Asn Tyr He Asn Ala Ala 
50 55 60 

AAT GGC GAT GCG AAA AAA TTC AGC ACT CTA GTG GAA AAA ATG CGT TTT 419 
Asn Gly Asp Ala Lys Lys Phe Ser Thr Leu Val Glu Lys Met Arg Phe 
65 70 75 

AAC TCC GGT ATA TTG GGG AAT TTA AGA GTG CAT GCA CGT TTG AGG CAA 4 67 

Asn Ser Gly He Leu Gly Asn Leu Arg Val His Ala Arg Leu Arg Gin 
80 85 90 95 

GCC CTA AAA TTG CAA AAG AAT TTG AAA TAT TGC CTT AAA ATC ATC GCT 515 
Ala Leu Lys Leu Gin Lys Asn Leu Lys Tyr Cys Leu Lys He He Ala 
100 105 HO 

AGG GAT TCT TTT TAT AGC TAC CGC ACC GGT ATT TAT ATC CCC TTA GGC 56 3 

Arg Asp Ser Phe Tyr Ser Tyr Arg Thr Gly He Tyr He Pro Leu Gly 
115 120 125 

ATT TCT TTA AAA GAT CAA AAA ACG GCT CAA AAA ATG CTC GCT GAT TTG 611 
He Ser Leu Lys Asp Gin Lys Thr Ala Gin Lys Met Leu Ala Asp Leu 
130 135 140 

AGC GTG GTA GGG GCG TAT CTT AAA AAA CAA CAA GAG AAT GAA AAG GCT 65 9 

Ser Val Val Gly Ala Tyr Leu Lys Lys Gin Gin Glu Asn Glu Lys Ala 
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145 150 155 

CAA AGC CCT TAT TAC AGA AAC AAC AAC TAT TAC AAC TCT TAC TAT AGC 7 07 

Gin Ser Pro Tyr Tyr Arg Asn Asn Asn Tyr Tyr Asn Ser Tyr Tyr Ser 
160 165 170 175 

CCT TAT TAC GGA ATG TAT GGT ATG TAT GGC ATG GGC ATG TAT GGA ATG 7 55 

Pro Tyr Tyr Gly Met Tyr Gly Met Tyr Gly Met Gly Met Tyr Gly Met 
180 185 190 

TAT GGC ATG GGC ATG TAT GAT TTT TAT GAC TTT TAT GAT GGC ATG TAT 8 03 

Tyr Gly Met Gly Met Tyr Asp Phe Tyr Asp Phe Tyr Asp Gly Met Tyr 
195 200 205 

GGA TTC TAC CCT AAC ATG TTT TTC ATG ATG CAA GTT CAA GAT TAC TTG 851 
Gly Phe Tyr Pro Asn Met Phe Phe Met Met Gin Val Gin Asp Tyr Leu 
210 215 220 

ATG TTA GAA AAT TAC ATG TAT GCG CTC GAT CAA GAA GAG ATT TTA GAT 899 
Met Leu Glu Asn Tyr Met Tyr Ala Leu Asp Gin Glu Glu lie Leu Asp 
225 230 235 

CAT GAC GCT TCT ACT GAC CAA CTT GAT ACG CCT ACT GAT GAT GAC AAA 94 7 

His Asp Ala Ser Thr Asp Gin Leu Asp Thr Pro Thr Asp Asp Asp Lys 
240 245 250 255 

GAC GAT AAA GAC GAT AAA TCC TTA CAG CAG GCA AAT CTT ATG AAC TTT 9 95 

Asp Asp Lys Asp Asp Lys Ser Leu Gin Gin Ala Asn Leu Met Asn Phe 
260 265 270 

TAT CGT GAT CCC AAA TTC AGC AAA GGC ATT CAA ACC AAC CGC TTG AAT 104 3 

Tyr Arg Asp Pro Lys Phe Ser Lys Gly lie Gin Thr Asn Arg Leu Asn 

275 280 285 

AGC GCT TTA GTC AAT TTA GAC AAC AGT CGC ATG CTC AAA GAC AAT TCG 10 91 

Ser Ala Leu Val Asn Leu Asp Asn Ser Arg Met Leu Lys Asp Asn Ser 
290 295 300 

CTT TTC CAC ACT AAA GCC ATG CCC ACT AAA AGC GTG GAT GCG ATA ACT 113 9 

Leu Phe His Thr Lys Ala Met Pro Thr Lys Ser Val Asp Ala lie Thr 
305 310 315 

TCT CAA GCC AAA GAG CTT AAC CAT TTA GTG GGG CAA ATC AAA GAA ATG 118 7 

Ser Gin Ala Lys Glu Leu Asn His Leu Val Gly Gin lie Lys Glu Met 
320 325 330 335 

AAG CAA GAC GGG GCG AGT CCT AGT AAG ATT GAT TCA GTT GTC AAT AAA 12 3 5 

Lys" Gin Asp Gly Ala Ser Pro Ser Lys lie Asp Ser Val Val Asn Lys 
340 345 350 

GCT ATG GAA GTG AGG GAC AAG CTA GAC AAT AAT CTC AAC CAA CTA GAC 12 8 3 

Ala Met Glu Val Arg Asp Lys Leu Asp Asn Asn Leu Asn Gin Leu Asp 
355 360 365 

AAT GAC TTA AAA GAT CAA AAA GGG CTT TCA AGC GAG CAA CAA GCT CAA 13 31 

Asn Asp Leu Lys Asp Gin Lys Gly Leu Ser Ser Glu Gin Gin Ala Gin 
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370 375 380 

GTG GAT AAA GCC CTA GAG AGC GTG CAA CAA TTA AGC CAT AGC AGC GAT 13 7 9 

Val Asp Lys Ala Leu Asp Ser Val Gin Gin Leu Ser His Ser Ser Asp 
385 390 395 

GTG GTG GGG AAT TAT TTA GAC GGG AGT TTG AAA ATT GAT GGC GAT GAT 14 2 7 

Val Val Gly Asrt Tyr Leu Asp Gly Ser Leu Lys lie Asp Gly Asp Asp 
400 405 410 415 

AGA GAT GAT TTG AAT GAT GCG ATG AAT AAC CCT ATG CAA CAA CCC GTG 14 7 5 

Arg Asp Asp Leu Asn Asp Ala Met Asn Asn Pro Met Gin Gin Pro Val 
420 425 430 

CAA CAA ACG CCT ACT AGC AAC ATG GCC GAC ACC CAT GCA AAT GAC AGC 152 3 

Gin Gin Thr Pro Thr Ser Asn Met Ala Asp Thr His Ala Asn Asp Ser 
435 440 445 

AAG GAT CAA GGG AGT AAC GCG CTC ATA AAC CCT AAC AGC GCC ACT AAC 1571 
Lys Asp Gin Gly Ser Asn Ala Leu lie Asn Pro Asn Ser Ala Thr Asn 
450 455 460 

GCC GAC GAC ACT CAC ACT GAC GAT ACT CAC ACT GAC ACT AAC ACC ACA 1619 
Ala Asp Asp Thr His Thr Asp Asp' Thr His Thr Asp Thr Asn Thr Thr 
465 470 475 

AAC GAT GCT AGC ACC ACT GAC ACC CCC ACT GAC GAT AAA GAT GCT AGC 16 67 

Asn Asp Ala Ser Thr Thr Asp Thr Pro Thr Asp Asp Lys Asp Ala Ser 
480 485 490 495 

GGC TTG AAC AAT ACC GGC GAT ATG AAT AAC ACG GAT ACC GGC AAC ACG 1715 
Gly Leu Asn Asn Thr Gly Asp Met Asn Asn Thr Asp Thr Gly Asn Thr 
500 505 510 

GAC ACC GGC AAT ACG GAT ACC GGT AAC ACT GAT GAT ATG AGC AAC ATG 17 63 

Asp Thr Gly Asn Thr Asp Thr Gly Asn Thr Asp Asp Met Ser Asn Met 
515 520 525 

AAC AAC GGC AAC GAT GAT ACG GGT AAC GCT AAT GAC GAC ATG AGC AAC 1811 
Asn Asn Gly Asn Asp Asp Thr Gly Asn Ala Asn Asp Asp Met Ser Asn 
530 535 540 

GGC AAC GAC ATG GGC GAT GAT TTG AAC AAC GCG AAC GAT ATG AAC GAC 185 9 

Gly Asn Asp Met Gly Asp Asp Leu Asn Asn Ala Asn Asp Met Asn Asp 
545 550 555 

GAC ATG GGT AAT GGC AAC GAT GAC ATG GGC GAT ATG GGG GAT ATG AAC 1907 
Asp Met Gly Asn Gly Asn Asp Asp Met Gly Asp Met Gly Asp Met Asn 
560 565 570 575 

GAC GAT ATG GGT GGC GAT ATG GGA GAC ATG GGG GAT ATG GGC GAT ATG 1955 
Asp Asp Met Gly Gly Asp Met Gly Asp Met Gly Asp Met Gly Asp Met 
580 585 590 

GGG AAT TGAGATTAAC CCCAATATCA AAGAGTGATA GCCAAAACTT TAAGGAATAT TT 2 013 
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Gly Asn 

TTATAGTAAA AACGATTCTT TTAAGGTAAT AGGGGGGATA TTTTGC 2 05 9 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Met Lys Ala lie Lys lie Leu Phe lie Met Thr Leu Ser Leu Asn Ala 

15 10 15 

lie Ser Val Asn Arg Ala Leu Phe Asp Leu Lys Asp Ser Gin Leu Lys 

20 25 30 

Gly Glu Leu Thr Pro Lys lie Val Asn Phe Gly Gly Tyr Lys Ser Ser 

35 40 45 

Thr Glu Glu Trp Gly Ala Thr Ala Leu Asn Tyr lie Asn Ala Ala Asn 

50 55 60 

Gly Asp Ala Lys Lys Phe Ser Thr Leu Val Glu Lys Met Arg Phe Asn 
65 70 75 80 

Ser Gly lie Leu Gly Asn Leu Arg Val His Ala Arg Leu Arg Gin Ala 

85 90 95 

Leu Lys Leu Gin Lys Asn Leu Lys Tyr Cys Leu Lys lie lie Ala Arg 

100 105 110 

Asp Ser Phe Tyr Ser Tyr Arg Thr Gly lie Tyr lie Pro Leu Gly lie 

115 120 125 

Ser Leu Lys Asp Gin Lys Thr Ala Gin Lys Met Leu Ala Asp Leu Ser 

130 135 140 

Val Val Gly Ala Tyr Leu Lys Lys Gin Gin Glu Asn Glu Lys Ala Gin 
145 150 155 160 

Ser Pro Tyr Tyr Arg Asn Asn Asn Tyr Tyr Asn Ser Tyr Tyr Ser Pro 

165 170 175 

Tyr Tyr Gly Met Tyr Gly Met Tyr Gly Met Gly Met Tyr Gly Met Tyr 

180 185 190 

Gly Met Gly Met Tyr Asp Phe Tyr Asp Phe Tyr Asp Gly Met Tyr Gly 

195 200 205 

Phe Tyr Pro Asn Met Phe Phe Met Met Gin Val Gin Asp Tyr Leu Met 

210 215 220 

Leu Glu Asn Tyr Met Tyr Ala Leu Asp Gin Glu Glu lie Leu Asp His 
225 230 235 240 

Asp Ala Ser Thr Asp Gin Leu Asp Thr Pro Thr Asp Asp Asp Lys Asp 

245 250 255 

Asp Lys Asp Asp Lys Ser Leu Gin Gin Ala Asn Leu Met Asn Phe Tyr 

260 265 270 

Arg Asp Pro Lys Phe Ser Lys Gly He Gin Thr Asn Arg Leu Asn Ser 

275 280 285 

Ala Leu Val Asn Leu Asp Asn Ser Arg Met Leu Lys Asp Asn Ser Leu 
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290 



Phe 


His 


Thr 


Lys 


Ala 


Met 


305 










310 


Gin 


Ala 


Lys 


Glu 


Leu 


Asn 










325 




Gin 


Asp 


Gly 


Ala 


Ser 


Pro 








340 






Met 


Glu 


Val 


Arg 


Asp 


Lys 






355 








Asp 


Leu 


Lys 


Asp 


Gin 


Lys 




370 










Asp 


Lys 


Ala 


Leu 


Asp 


Ser 


385 










390 


Val 


Gly 


Asn 


Tyr 


Leu 


Asp 










405 




Asp 


Asp 


Leu 


Asn 


Asp 


Ala 








420 






Gin 


Thr 


Pro 


Thr 


Ser 


Asn 






435 








Asp 


Gin 


Gly 


Ser 


Asn 


Ala 




450 










Asp 


Asp 


Thr 


His 


Thr 


Asp 


465 










470 


Asp 


Ala 


Ser 


Thr 


Thr 


Asp 










485 




Leu 


Asn 


Asn 


Thr 


Gly 


Asp 








500 






Thr 


Gly 


Asn 


Thr 


Asp 


Thr 






515 








Asn 


Gly 


Asn 


Asp 


Asp 


Thr 




530 










Asn 


Asp 


Met 


Gly 


Asp 


Asp 


545 










550 


Met 


Gly 


Asn 


Gly 


Asn 


Asp 










565 




Asp 


Met 


Gly 


Gly 


Asp 


Met 



580 



Asn 
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295 

Pro Thr Lys Ser Val 
315 

His Leu Val Gly Gin 
330 

Ser Lys lie Asp Ser 
345 

Leu Asp Asn Asn Leu 
360 

Gly Leu Ser Ser Glu 
375 

Val Gin Gin Leu Ser 
395 

Gly Ser Leu Lys lie 
410 

Met Asn Asn Pro Met 
425 

Met Ala Asp Thr His 
440 

Leu lie Asn Pro Asn 
455 

Asp Thr His Thr- Asp 
475 

Thr Pro Thr Asp Asp 
490 

Met Asn Asn Thr Asp 
505 

Gly Asn Thr Asp Asp 
520 

Gly Asn Ala Asn Asp 
535 

Leu Asn Asn Ala Asn 
555 

Asp Met Gly Asp Met 
570 

Gly Asp Met Gly Asp 
585 



PCT/US97/21353 



300 

Asp Ala lie Thr Ser 
320 

lie Lys Glu Met Lys 
335 

Val Val Asn Lys Ala 
350 

Asn Gin Leu Asp Asn 
365 

Gin Gin Ala Gin Val 
380 

His Ser Ser Asp Val 
400 

Asp Gly Asp Asp Arg 
415 

Gin Gin Pro Val Gin 
430 

Ala Asn Asp Ser Lys 
445 

Ser Ala Thr Asn Ala 
460 

Thr Asn Thr Thr Asn 
480 

Lys Asp Ala Ser Gly 
495 

Thr Gly Asn Thr Asp 
510 

Met Ser Asn Met Asn 
525 

Asp Met Ser Asn Gly 
540 

Asp Met Asn Asp Asp 
560 

Gly Asp Met Asn Asp 
575 

Met Gly Asp Met Gly 
590 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( ix ) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 112. -.1461 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67 : 

AATGAGCGAT TTGAAAGATT TTGTCAATAA AACTTCAAGC CCTTTAAATG CGAATTGATT 6 0 

TTCTTATATT ATGATTACGA TTTATCAATT TAAAACATTT GGAGAAAGAC A ATG AGT 117 

Met Ser 
1 

ATG GAA TTT GAT GCT GTT ATT ATT GGA GGT GGG GTT TCA GGG TGC GCG 16 5 

Met Glu Phe Asp Ala Val lie lie Gly Gly Gly Val Ser Gly Cys Ala 
5 10 15 

ACC TTT TAT ACT TTG AGC GAA TAC AGC TCT TTA AAG CGC GTG GCT ATC 213 
Thr Phe Tyr Thr Leu Ser Glu Tyr Ser Ser Leu Lys Arg Val Ala lie 
20 25 30 

GTG GAA AAA TGC TCT AAA TTG GCT CAA ATC AGC TCC AGC GCT AAA GCT 2 61 

Val Glu Lys Cys Ser Lys Leu Ala Gin lie Ser Ser Ser Ala Lys Ala 
35 40 45 50 

AAT TCG CAA ACC ATT CAT GAT GGC TCT ATT GAA ACG AAT TAC ACT CCC 30 9 

Asn Ser Gin Thr lie His Asp Gly Ser lie Glu Thr Asn Tyr Thr Pro 
55 60 65 

GAA AAA GCT AAA AAA GTG CGT TTG AGC GCT TAT AAG ACC AGG CAA TAC 3 57 

Glu Lys Ala Lys Lys Val Arg Leu Ser Ala Tyr Lys Thr Arg Gin Tyr 
70 75 80 

GCT TTG AAT AAA GGC TTG CAA AAT GAA GTG ATT TTT GAA ACC CAG AAA 4 05 

Ala Leu Asn Lys Gly Leu Gin Asn Glu Val lie Phe Glu Thr Gin Lys 
85 90 95 

ATG GCT ATA GGC GTG GGC GAT GAA GAA TGC GAG TTC ATG AAA AAA CGC 453 
Met Ala lie Gly Val Gly Asp Glu Glu Cys Glu Phe Met Lys Lys Arg 
100 105 110 

TAC GAA TCT TTT AAA GAA ATC TTT GTG GGG TTA GAA GAA TTT GAC AAG 501 
Tyr Glu Ser Phe Lys Glu lie Phe Val Gly Leu Glu Glu Phe Asp Lys 
115 120 125 130 

CAA AAG ATT AAA GAA TTA GAG CCT AAT GTG ATT TTA GGG GCT AAT GGC 54 9 

Gin Lys lie Lys Glu Leu Glu Pro Asn Val lie Leu Gly Ala Asn Gly 
135 140 145 

ATA GAC AGG CAT GAA AAC ATT ATC GGG CAT GGG TAT AGA AAG GAT TGG 5 97 

lie Asp Arg His Glu Asn lie lie Gly His Gly Tyr Arg Lys Asp Trp 
150 155 160 

AGC ACC ATG AAT TTT GCG AAG TTG AGT GAA AAC TTC GTT GAA GAA GCC 64 5 

Ser Thr Met Asn Phe Ala Lys Leu Ser Glu Asn Phe Val Glu Glu Ala 
165 170 175 

CTA AAA TTA AAG CCT AAC AAC CAG GTG TTT TTG AAT TTC AAA GTG AAA 69 3 

Leu Lys Leu Lys Pro Asn Asn Gin Val Phe Leu Asn Phe Lys Val Lys 
180~~ 185 190 
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AAG ATT GAA AAA CGC AAC GAC ACT TAC GCC GTA ATT TCA GAA GAC GCT 741 

Lys He Glu Lys Arg Asn Asp Thr Tyr Ala Val He Ser Glu Asp Ala 
195 200 205 210 

GAA GAA GTG TAT GCT AAA TTC GTG CTG GTC AAT GCC GGC TCT TAC GCT 78 9 

Glu Glu Val Tyr Ala Lys Phe Val Leu Val Asn Ala Gly Ser Tyr Ala 
215 220 225 

TTG CCT TTG GCT CAG AGC ATG GGC TAT GGC CTA GAT TTA GGG TGC TTG 83 7 

Leu Pro Leu Ala Gin Ser Met Gly Tyr Gly Leu Asp Leu Gly Cys Leu 
230 235 240 

CCT GTG GCG GGC AGC TTT TAT TTT GTG CCG GAT TTA TTA AGG GGT AAG 8 85 

Pro Val Ala Gly Ser Phe Tyr Phe Val Pro Asp Leu Leu Arg Gly Lys 
245 250 255 

GTT TAT ACC GTT CAA AAC CCC AAA CTC CCT TTT GCA GCC GTG CAT GGC 93 3 

Val Tyr Thr Val Gin Asn Pro Lys Leu Pro Phe Ala Ala Val His Gly 
260 265 270 

GAC CCT GAT GCC GTC ATT AAA GGA AAA ACA CGA ATC GGG CCT ACC GCT 9S1 
Asp Pro Asp Ala Val He Lys Gly Lys Thr Arg He Gly Pro Thr Ala 
275 280 285 290 

TTA ACG ATG CCT AAA TTA GAA CGC AAC AAA TGT TGG CTT AAG GGC ATT 102 9 

Leu Thr Met Pro Lys Leu Glu Arg Asn Lys Cys Trp Leu Lys Gly He 
295 300 305 

AGC TTG GAA TTG TTG AAA ATG GAT TTG AAT AAA GAT GTG TTT AAA ATT 1077 
Ser Leu Glu Leu Leu Lys Met Asp Leu Asn Lys Asp Val Phe Lys He 
310 315 320 

GCG TTT GAT TTG ATG AGC GAT AAA GAA ATC CGA AAT TAT GTG TTT AAA 112 5 

Ala Phe Asp Leu Met Ser Asp Lys Glu He Arg Asn Tyr Val Phe Lys 
325 330 335 

AAC ATG GTT TTT GAA TTG CCC ATT ATC GGT AAA AGG AAA TTT TTA AAA 117 3 

Asn Met Val Phe Glu Leu Pro He He Gly Lys Arg Lys Phe Leu Lys 
340 345 350 

GAC GCT CAA AAA ATC ATC CCC TCT CTT AGC CTA GAA GAT CTA GAA TAC 1221 
Asp Ala Gin Lys He He Pro Ser Leu Ser Leu Glu Asp Leu Glu Tyr 
355 360 365 370 

GCT CAT GGT TTT GGT GAA GTG CGC CCG CAA GTT TTA GAC AGA ACC AAG 1269 
Ala His Gly Phe Gly Glu Val Arg Pro Gin Val Leu Asp Arg Thr Lys 
375 380 385 

CGA AAA CTG GAA TTA GGC GAA AAA AAG ATT TGC ACC CAT AAA GGC ATC 1317 
Arg Lvs Leu Glu Leu Gly Glu Lys Lys He Cys Thr His Lys Gly He 
390 395 400 

ACT TTT AAC ATG ACC CCT TCT CCA GGC GCG ACG AGT TGT TTG CAA AAC 13 65 

Thr Phe Asn Met Thr Pro Ser Pro Gly Ala Thr Ser Cys Leu Gin Asn 
405 410 415 
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GCC CTT GTG GAT TCC CAA GAA ATC GCT GCG TAT TTG GGC GAG AGC TTT 1413 
Ala Leu Val Asp Ser Gin Glu lie Ala Ala Tyr Leu Gly Glu Ser Phe 
420 425 430 

GAA TTA GAA CGC TTT TAT AAA GAT TTA TCC CCA GAA GAA TTG GAA AAT T 14 62 
Glu Leu Glu Arg Phe Tyr Lys Asp Leu Ser Pro Glu Glu Leu Glu Asn 
435 440 445 450 

AAAAACGCAT GCAAAAAGAA CAAGAAGCCC AAGAAATCGC TAAAAAAGCC GTTAAAATCG 152 2 
TGTTT 15 2 7 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Met Ser Met Glu Phe Asp Ala Val lie lie Gly Gly Gly Val Ser Gly 

15 10 15 

Cys Ala Thr Phe Tyr Thr Leu Ser Glu Tyr Ser Ser Leu Lys Arg Val 

20 25 30 

Ala lie Val Glu Lys Cys Ser Lys Leu Ala Gin lie Ser Ser Ser Ala 

35 40 45 

Lys Ala Asn Ser Gin Thr lie His Asp Gly Ser lie Glu Thr Asn Tyr 

50 55 60 

Thr Pro Glu Lys Ala Lys Lys Val Arg Leu Ser Ala Tyr Lys Thr Arg 
65 70 75 80 

Gin Tyr Ala Leu Asn Lys Gly Leu Gin Asn Glu Val lie Phe Glu Thr 

85 90 95 

Gin Lys Met Ala lie Gly Val Gly Asp Glu Glu Cys Glu Phe Met Lys 

100 105 110 

Lys Arg Tyr Glu Ser Phe Lys Glu lie Phe Val Gly Leu Glu Glu Phe 

115 120 125 

Asp Lys Gin Lys lie Lys Glu Leu Glu Pro Asn Val He Leu Gly Ala 

130 135 140 

Asn Gly He Asp Arg His Glu Asn He He Gly His Gly Tyr Arg Lys 
145 150 155 160 

Asp Trp Ser Thr Met Asn Phe Ala Lys Leu Ser Glu Asn Phe Val Glu 

165 170 175 

Glu Ala Leu Lys Leu Lys Pro Asn Asn Gin Val Phe Leu Asn Phe Lys 

180 185 190 

Val Lys Lys He Glu Lys Arg Asn Asp Thr Tyr Ala Val He Ser Glu 

195 200 205 

Asp Ala Glu Glu Val Tyr Ala Lys Phe Val Leu Val Asn Ala Gly Ser 

210 215 220 

Tyr Ala Leu Pro Leu Ala Gin Ser Met Gly Tyr Gly Leu Asp Leu Gly 
225 230 235 240 

Cys Leu Pro Val Ala Gly Ser Phe Tyr Phe Val Pro Asp Leu Leu Arg 
245 250 255 
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Gly Lys Val Tyr Thr Val Gin Asn Pro Lys Leu Pro Phe Ala Ala Val 

260 265 270 

His Gly Asp Pro Asp Ala Val lie Lys Gly Lys Thr Arg lie Gly Pro 

275 280 285 

Thr Ala Leu Thr Met Pro Lys Leu Glu Arg Asn Lys Cys Trp Leu Lys 

290 295 300 

Gly He Ser Leu Glu Leu Leu Lys Met Asp Leu Asn Lys Asp Val Phe 
305 310 315 320 

Lys He Ala Phe Asp Leu Met Ser Asp Lys Glu He Arg Asn Tyr Val 

325 330 335 

Phe Lys Asn Met Val Phe Glu Leu Pro He He Gly Lys Arg Lys Phe 

340 345 350 

Leu Lys Asp Ala Gin Lys He He Pro Ser Leu Ser Leu Glu Asp Leu 

355 360 365 

Glu Tyr Ala His Gly Phe Gly Glu Val Arg Pro Gin Val Leu Asp Arg 

370 375 380 

Thr Lys Arg Lys Leu Glu Leu Gly Glu Lys Lys He Cys Thr His Lys 
385 390 395 400 

Gly He Thr Phe Asn Met Thr Pro Ser Pro Gly Ala Thr Ser Cys Leu 

405 410 415 

Gin Asn Ala Leu Val Asp Ser Gin Glu He Ala Ala Tyr Leu Gly Glu 

420 425 430 

Ser Phe Glu Leu Glu Arg Phe Tyr Lys Asp Leu Ser Pro Glu Glu Leu 
435 440 445 

Glu Asn 
450 

(2) INFORMATION FOR SEQ ID NO : 6 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 63... 590 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

CTAGATTTAA TTTTAAAGTT ATATAATTAA ACCACAAAAT CCTTTTTTAA AAGAAACTAA 6 0 

GC ATG CCA AAA CCC AAG AAA AAC ACC CTC CCC TGT AGC CTT TCT GTC 107 
Met Pro Lys Pro Lys Lys Asn Thr Leu Pro Cys Ser Leu Ser Val 
15 10 15 

AAA ATG TCT TAT TTC ATG CGC TTT CTC ATT AAA TGG CGC ACC CGC TCT 15 5 

Lys Met Ser Tyr Phe Met Arg Phe Leu He Lys Trp Arg Thr Arg Ser 
20 25 30 

TTA AGC CAT AAA ATG ATG ACT CTC ATT CAA ATC TTA AGC ATT CTG GCT 2 03 
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Leu Ser His Lys Met Met Thr Leu lie Gin lie Leu Ser lie Leu Ala 
35 40 45 

TTA GCG AGC AAG GCC AGT GAA GAT TTA GAA GAG CAA CTC AAA AAA ATC 2 51 

Leu Ala Ser Lys Ala Ser Glu Asp Leu Glu Glu Gin Leu Lys Lys He 
50 55 60 

AAA GAT TAG ATT TAT AGA ACC CTA AAC GCT AAA ATC GCA TCG GAT GTG 2 99 

Lys Asp Tyr He Tyr Arg Thr Leu Asn Ala Lys lie Ala Ser Asp Val 
65 70 75 

TAT AAC CGA GTG CTT ATT TTA GTG AAT GAA TAT TGC ACT AAT GAA GAA 34 7 

Tyr Asn Arg Val Leu He Leu Val Asn Glu Tyr Cys Thr Asn Glu Glu 
80 85 90 95 

TTG TTT GAC AAA GAG AGC GTT AAA ATT TCA GAT TTA CTC ATT CAA GAC 3 95 

Leu Phe Asp Lys Glu Ser Val Lys He Ser Asp Leu Leu He Gin Asp 

100 105 110 

ATT CAG CTT TAC GCT TTA GTG GAT GAA ATG CTT AAA GAA GAT AAA TAT 4 43 

He Gin Leu Tyr Ala Leu Val Asp Glu Met Leu Lys Glu Asp Lys Tyr 
115 120 125 

CAA GTC CAG CAC ACC ATT TTA AAG GGC ATC ATC AAA CGC AAA TAC GAT 4 91 

Gin Val Gin His Thr He Leu Lys Gly He He Lys Arg Lys Tyr Asp 
130 135 140 

GAA GCC TAC TCG CTC AAT AGC GAA GAC AGG ATT CTT TTA GAA TAC CAA 53 9 

Glu Ala Tyr Ser Leu Asn Ser Glu Asp Arg He Leu Leu Glu Tyr Gin 
145 150 155 

GAA CGC TTG CTA GAA CAC TCA CAC GCG TCT TTT TCA AAT AAA AAA TTC 5 87 

Glu Arg Leu Leu Glu His Ser His Ala Ser Phe Ser Asn Lys Lys Phe 
160 165 170 175 

AAA TGATTTGAAA GCGTTACTTG CCCTGCTTTT TGGGCTTTTA TTGAAAAAGG GCTTTA 64 6 
Lys 



AAATGAG 653 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
<v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Pro Lys Pro Lys Lys Asn Thr Leu Pro Cys Ser Leu Ser Val Lys 
15 10 15 
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Met Ser Tyr Phe Met Arg Phe Leu He Lys Trp Arg Thr Arg Ser Leu 

20 25 30 

Ser His Lys Met Met Thr Leu He Gin He Leu Ser He Leu Ala Leu 

35 40 45 

Ala Ser Lys Ala Ser Glu Asp Leu Glu Glu Gin Leu Lys Lys He Lys 

50 55 60 

Asp Tyr He Tyr Arg Thr Leu Asn Ala Lys He Ala Ser Asp Val Tyr 
65 70 75 80 

Asn Arg Val Leu He Leu Val Asn Glu Tyr Cys Thr Asn Glu Glu Leu 

85 90 95 

Phe Asp Lys Glu Ser Val Lys He Ser Asp Leu Leu He Gin Asp He 

100 105 HO 

Gin Leu Tyr Ala Leu Val Asp Glu Met Leu Lys Glu Asp Lys Tyr Gin 

115 120 125 

Val Gin His Thr He Leu Lys Gly He He Lys Arg Lys Tyr Asp Glu 

130 135 140 

Ala Tyr Ser Leu Asn Ser Glu Asp Arg He Leu Leu Glu Tyr Gin Glu 
145 150 155 160 

Arg Leu Leu Glu His Ser His Ala Ser Phe Ser Asn Lys Lys Phe Lys 
165 170 175 

(2) INFORMATION FOR SEQ ID NO : 7 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1883 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 91... 1833 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

AAGCGTTAAA TTCCAATCAA AAACCATCGT ATCGGTGTTA ATATTGTGTA AAAATTAATG 
TTATGAATCT CTTGTATTAA AAGGACTTCA ATG AAA AAA TTG GTT TTA GTC ATC 

Met Lys Lys Leu Val Leu Val He 
1 5 

TTT TTA ACG CTA GCG CTT TCA ATA TCT GCA AAA GAA GTC AAA ATA GTG 
Phe Leu Thr Leu Ala Leu Ser He Ser Ala Lys Glu Val Lys He Val 
10 15 20 



TTT TTA GAA ACT 
Phe Leu Glu Thr 
25 

GCG ATT GGC GAG 
Ala He Gly Glu 



TCA GAC ATT CAT GGG 
Ser Asp lie His Gly 
30 

CAA AAA CCC AAT AAC 
Gin Lys Pro Asn Asn 
45 



CGG CTT TTT TCG TAT 
Arg Leu Phe Ser Tyr 
35 

GGC TTG ACA AGG ATT 
Gly Leu Thr Arg He 
50 



GAT TAT 210 
Asp Tyr 
40 

GCG ACT 25 8 

Ala Thr 

55 
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TTA ATC AAA AAG CAA AGG GCT GAG AAT AAA AAT GTG GTT TTG ATT GAC 3 06 

Leu lie Lys Lys Gin Arg Ala Glu Asn Lys Asn Val Val Leu He Asp 

60 65 70 

AGC GGG GAT TTG TTG CAA GGC AAT AGC GCG GAG TTG TTT AAT GAT GAG 3 54 

Ser Gly Asp Leu Leu Gin Gly Asn Ser Ala Glu Leu Phe Asn Asp Glu 
75 80 85 

CCA ATT CAT CCG CTA GTT AGA GCT GAA AAC GAT TTG AAA TTT GAC ATT 4 02 

Pro He His Pro Leu Val Arg Ala Glu Asn Asp Leu Lys Phe Asp He 
90 95 100 

CGT GTG CTT GGC AAT CAC GAG TTT AAT TTC AGT AAA GAT TTT TTA GAA 45 0 

Arg Val Leu Gly Asn His Glu Phe Asn Phe Ser Lys Asp Phe Leu Glu 
105 110 115 120 

AAG AAT ATT AAG GGG TTT AAT GGC GAT GTC ATG AAT GCG AAT ATC ATT 4 98 

Lys Asn He Lys Gly Phe Asn Gly Asp Val Met Asn Ala Asn He He 
125 130 135 

AAA ATT GCG GAC AAT AAG CCG TTT GTA AAA CCT TAT ATT ATT AAA AAA 54 6 

Lys He Ala Asp Asn Lys Pro Phe Val Lys Pro Tyr He He Lys Lys 
140 145 150 

ATT GAT GGC GTG AGG GTG GCG GTT GTG GGG TAT GTG GTG GCG CAC ATC 5 94 

He Asp Gly Val Arg Val Ala Val Val Gly Tyr Val Val Ala His He 
155 160 165 

CCA ACT TGG GAG GCC TCT ACG CCT GAA CAT TTT GCA GGA TTG AAG TTT 64 2 

Pro Thr Trp Glu Ala Ser Thr- Pro Glu His Phe Ala Gly Leu Lys Phe 
170 175 180 

TTG GAC GCT GAA GAA GCG TTA AAA AAG ACC TTA AAA GAG TTG AAA GGG 6 90 

Leu Asp Ala Glu Glu Ala Leu Lys Lys Thr Leu Lys Glu Leu Lys Gly 
185 190 195 200 

AAG TAT GAT ATT TTG ATT GGC GCT TTT CAT TTG GGG CGA GAA GAT GAG 73 8 

Lys Tyr Asp He Leu He Gly Ala Phe His Leu Gly Arg Glu Asp Glu 
205 210 215 

AAA GGT GGC GAC GGG ATA CCG GAT TTA GCG AAA AAA TTC CCG CAA TTT 7 86 

Lys Gly Gly Asp Gly He Pro Asp Leu Ala Lys Lys Phe Pro Gin Phe 
220 225 230 

GAC ATC ATT TTT GCA GGG CAT GAG CAT GCG GTT TAT AAC ACC AAA GTA 834 
Asp lie lie Phe Ala Gly His Glu His Ala Val Tyr Asn Thr Lys Val 
235 240 245 

GGG AAA GTG CAT ACC ATT GAG CCT GGA GCG TAT GGG GCT TAT CTG GCA 8 82 

Gly Lys Val His Thr He Glu Pro Gly Ala Tyr Gly Ala Tyr Leu Ala 
250 255 260 

AAG GGC GTG GTG GTA TTT GAC ACT AAA ACG AAG AAA AAA ATT ATA ACG 93 0 

Lys Gly Val Val Val Phe Asp Thr Lys Thr Lys Lys Lys He He Thr 
265 270 275 280 
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ACT GAA AAT TTA CCC ACA AAA GAT GTG CCA GAA GAT GAA GAA TTA GCG 97 8 

Thr Glu Asn Leu Pro Thr Lys Asp Val Pro Glu Asp Glu Glu Leu Ala 
285 290 295 

AAA AAA TAG GAA TAT GTG GAT AAA AAA TCA AAA GAA TAC GCT AAT GAA 102 6 

Lys Lys Tyr Glu Tyr Val Asp Lys Lys Ser Lys Glu Tyr Ala Asn Glu 
300 305 310 

GTG GTT GGC GAA GTT ACA AAA ACC TTT ATT GAC AGG CCT GAT TTT ATC 10 74 

Val Val Gly Glu Val Thr Lys Thr Phe lie Asp Arg Pro Asp Phe lie 

315 320 325 

ACA GGA GAA GAA AAA ATC ACC ACG ATG CCC ACC GCC GCC TTG CAA GAA 112 2 

Thr Gly Glu Glu Lys He Thr Thr Met Pro Thr Ala Ala Leu Gin Glu 
330 335 340 

ACA CCG GTG ATA GAA TTG ATT AAT AAA GTG CAA AAA TAT TAC GCA AAA 117 0 

Thr Pro Val He Glu Leu He Asn Lys Val Gin Lys Tyr Tyr Ala Lys 
345 350 355 360 

GCC GAT GTT TCA GCG GCA GCC TTA TTC AAT TTT GGG GCG AAT TTG AAA 1218 
Ala Asp Val Ser Ala Ala Ala Leu Phe Asn Phe Gly Ala Asn Leu Lys 
365 370 375 

AAA GGG CCT TTC AAA AG A AAA GAT GTC ACT TAT ATT TAC AAG TTC GCT 12 66 

Lys Gly Pro Phe Lys Arg Lys Asp Val Thr Tyr He Tyr Lys Phe Ala 
380 385 390 

AAT ACG CTC ATT GGA GTG CGT ATA ACG GGT GAA AAT CTG TTG AAA TAC 1314 
Asn Thr Leu He Gly Val Arg He Thr Gly Glu Asn Leu Leu Lys Tyr 
395 400 405 

ATG GAA TGG TCA TAC CGA TTT TAC AAT CAG TTG CAA CCA GGA GAT TTG 13 62 

Met Glu Trp Ser Tyr Arg Phe Tyr Asn Gin Leu Gin Pro Gly Asp Leu 
410 415 420 

ACG ATC AGT TTT AAT GAA AAC ATT CGC GGC TAT AAC TTT GAT ATG TTT 1410 
Thr He Ser Phe Asn Glu Asn He Arg Gly Tyr Asn Phe Asp Met Phe 
425 430 435 440 

TCT GGC GTG AAA TAC CAG GTT GAT GTT ACA AAA CCC GCC GGA CAA AGG 145 8 

Ser Gly Val Lys Tyr Gin Val Asp Val Thr Lys Pro Ala Gly Gin Arg 
445 450 455 

ATT ATC AAT CCG ACA ATC AAC AAC AAA CCC ATT GAC CCC AAA GCC ATC 15 06 

He He Asn Pro Thr He Asn Asn Lys Pro He Asp Pro Lys Ala He 
460 465 470 

TAT AAA TTA GCG ATC AAC AAT TAC CGA TTC GGA ACA TTA TCC ACG ACA 1554 
Tyr Lys Leu Ala He Asn Asn Tyr Arg Phe Gly Thr Leu Ser Thr Thr 
475 480 485 

TTG AAT TTG GTT ACA GAC GCT GMT AGG TAT TAT AAT TCT TAC GAT GAA 16 02 

Leu Asn Leu Val Thr Asp Ala Xaa Arg Tyr Tyr Asn Ser Tyr Asp Glu 
490 495 500 
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CTG CAA GAT AAT GGG CAA ATA CGA GAT TTG ATC ATC AAA TAC ATC ACG 16 5 0 

Leu Gin Asp Asn Gly Gin lie Arg Asp Leu lie lie Lys Tyr lie Thr 
505 510 515 520 

GAA GAA AAA GGT GGG AAG GTA ACC CCT GAA TTG GAG GGT AAT TGG GAA 16 98 

Glu Glu Lys Gly Gly Lys Val Thr Pro Glu Leu Glu Gly Asn Trp Glu 
525 530 535 

ATC ATC AAC TAC GAT TTC AAA AAC CCG TTG TTG GAA AAA TTG AGA GAA 174 6 

lie lie Asn Tyr Asp Phe Lys Asn Pro Leu Leu Glu Lys Leu Arg Glu 
540 545 550 

AAA TTA AAA GAG GGG AGC ATC AAA ATC CCC ACC TCA AAG GAT GGG AGG 17 94 

Lys Leu Lys Glu Gly Ser lie Lys lie Pro Thr Ser Lys Asp Gly Arg 
555 560 565 

ACT TTG AAT GTC AAA TCC ATT AAA GAG AGT GAA GTT AAA TAAAATT 184 0 

Thr Leu Asn Val Lys Ser lie Lys Glu Ser Glu Val Lys 
570 575 580 

TTTTATTTTT ATTATTTTAT CTTTAAGCCT AACTTAAAAA AGG 18 83 



(2) INFORMATION FOR SEQ ID NO: 72: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Met Lys Lys Leu Val Leu Val lie Phe Leu Thr Leu Ala Leu Ser lie 

1 5 10 15 

Ser Ala Lys Glu Val Lys lie Val Phe Leu Glu Thr Ser Asp lie His 

20 25 30 

Gly Arg Leu Phe Ser Tyr Asp Tyr Ala lie Gly Glu Gin Lys Pro Asn 

35 40 45 

Asn Gly Leu Thr Arg lie Ala Thr Leu lie Lys Lys Gin Arg Ala Glu 

50 55 60 

Asn Lys Asn Val Val Leu lie Asp Ser Gly Asp Leu Leu Gin Gly Asn 
65 70 75 80 

Ser Ala Glu Leu Phe Asn Asp Glu Pro lie His Pro Leu Val Arg Ala 

85 90 95 

Glu Asn Asp Leu Lys Phe Asp lie Arg Val Leu Gly Asn His Glu Phe 

100 105 110 

Asn Phe Ser Lys Asp Phe Leu Glu Lys Asn lie Lys Gly Phe Asn Gly 

115 120 125 

Asp Val Met Asn Ala Asn lie lie Lys lie Ala Asp Asn Lys Pro Phe 

130 135 140 

Val Lys Pro Tyr lie lie Lys Lys lie Asp Gly Val Arg Val Ala Val 
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14 5 










150 










155 










160 


Val 


Glv 


Tyr 


Val 


Val 


Ala 


His 


He 


Pro 


Thr 


Trp 


Glu 


Ala 


Ser 


Thr 


Pro 






165 










170 










175 




Glu 


Hi s 


Phe 


Ala 


Gly 


Leu 


Lys 


Phe 


Leu 


Asp 


Ala 


Glu 


Glu 


Ala 


Leu 


Lys 








180 










185 










190 








Thr 


Leu 


Lvs 


Glu 


Leu 


Lys 


Gly 


Lys 


Tyr 


Asp 


He 


Leu 


He 


Gly 


Ala 




195 










200 










205 








Phe 


His 


Leu 


Gly 


Arg 


Glu 


Asp 


Glu 


Lys 


Gly 


Gly 


Asp 


Gly 


He 


Pro 


Asp 




210 










215 










220 










Leu 


Ala 


Lvs 


Lys 


Phe 


Pro 


Gin 


Phe 


Asp 


lie 


He 


Phe 


Ala 


Gly 


His 


Glu 


2 2 5 










230 










235 










240 


His 


Ala 


Val 


Tyr 

2 


Asn 


Thr 


Lys 


Val 


Gly 


Lys 


Val 


His 


Thr 


He 


Glu 


Pro 










245 










250 










255 




Gly 


Ala 


Tvr 

x y j. 


Glv 


Ala 


Tyr 


Leu 


Ala 


Lys 


Gly 


Val 


Val 


Val 


Phe 


Asp 


Thr 






260 










265 










270 






Lys 


Xhr 


Ly s 


Lys 


Lvs 


He 


He 


Thr 


Thr 


Glu 


Asn 


Leu 


Pro 


Thr 


Lys 


Asp 






275 










280 










285 








Val 


Pro 


Glu 


Asp 


Glu 


Glu 


Leu 


Ala 


Lys 


Lys 


Tyr 


Glu 


Tyr 


Val 


Asp 


Lys 




2 90 










295 










300 










Ly S 


Sei- 


Lys 


Glu 


Tyr 


Ala 


Asn 


Glu 


Val 


Val 


Gly 


Glu 


Val 


Thr 


Lys 


Thr 


3 0 5 








310 










315 










320 


Phe 


ne 


Asp 


Arg 


Pro 


Asp 


Phe 


He 


Thr 


Gly 


Glu 


Glu 


Lys 


lie 


Thr 


Thr 










325 










330 










335 




Met 


Pro 


Thr 


Ala 


Ala 


Leu 


Gin 


Glu 


Thr 


Pro 


Val 


He 


Glu 


Leu 


He 


Asn 








34 0 










345 










350 






Ly s 


Va 1 


Gin 


Lys 


Tyr 


Tyr 


Ala 


Lys 


Ala 


Asp 


Val 


Ser 


Ala 


Ala 


Ala 


Leu 






355 










360 










365 








Phe 


Asn 


Phe 


Glv 


Ala 


Asn 


Leu 


Lys 


Lys 


Gly 


Pro 


Phe 


Lys 


Arg 


Lys 


Asp 




370 










375 










380 










Val 


Thr 


Tvr 


lie 


Tyr 


Lys 


Phe 


Ala 


Asn 


Thr 


Leu 


He 


Gly 


Val 


Arg 


He 


3 85 








390 










395 










400 


Thr 


Glv 


Glu 


Asn 


Leu 


Leu 


Lys 


Tyr 


Met 


Glu 


Trp 


Ser 


Tyr 


Arg 


Phe 


Tyr 








405 










410 










415 




Asn 


Gin 


Leu 


Gin 


Pro 


Gly 


Asp 


Leu 


Thr 


He 


Ser 


Phe 


Asn 


Glu 


Asn 


He 








420 










425 










430 






Arg 


Glv 


Tvr 


Asn 


Phe 


Asp 


Met 


Phe 


Ser 


Gly 


Val 


Lys 


Tyr 


Gin 


Val 


Asp 






435 










440 










445 








Val 


Thr 


Lys 


Pro 


Ala 


Gly 


Gin 


Arg 


He 


He 


Asn 


Pro 


Thr 


He 


Asn 


Asn 




4 50 










455 










460 










Lys 


Pro 


lie 


Asp 


Pro 


Lys 


Ala 


He 


Tyr 


Lys 


Leu 


Ala 


He 


Asn 


Asn 


Tyr 


4 6 5 










470 










475 










480 




Phe 


Glv 


Thr 


Leu 


Ser 


Thr 


Thr 


Leu 


Asn 


Leu 


Val 


Thr 


Asp 


Ala 


Xaa 






485 










490 










495 




Arg 


j. y x 


Tvr 


Asn 


Ser 


Tyr 


Asp 


Glu 


Leu 


Gin 


Asp 


Asn 


Gly 


Gin 


He 


Arg 








500 










505 










510 






Asp 


Leu 


He 


He 


Lys 


Tyr 


He 


Thr 


Glu 


Glu 


Lys 


Gly 


Gly 


Lys 


Val 


Thr 




515 










520 










525 








Pro 


Glu 


Leu 


Glu 


Gly 


Asn 


Trp 


Glu 


He 


lie 


Asn 


Tyr 


Asp 


Phe 


Lys 


Asn 




530 










535 










540 










Pro 


JLeu 


Leu 


Glu 


Lys 


Leu 


Arg 


Glu 


Lys 


Leu 


Lys 


Glu 


Gly 


Ser 


He 


Lys 


545 










550 










555 










560 


lie 


Pro 


Thr 


Ser 


Lys 


Asp 


Gly 


Arg 


Thr 


Leu 


Asn 


Val 


Lys 


Ser 


He 


Lys 










565 










570 










575 





Glu Ser Glu Val Lys 
580 
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(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 6 8... 12 5 2 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CCAATCGTTT AATAGCGATT AAATATGACT ATATACACTA CAACAATAAG ATTTTGAAAG 6 0 

GTTGGTA ATG GAA TCA GTA AAA ACA GGA AAA ACA AAT AAG GTT GGC AAG 10 9 

Met Glu Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys 
15 10 

AAT ACA GAG ATG GCT AAT ACA AAG GCA AAT AAA GAG GCT CAT TTT AAA 157 
Asn Thr Glu Met Ala Asn Thr Lys Ala Asn Lys Glu Ala His Phe Lys 
15 20 25 30 

CAA GCG AGC ACC ATT ACA AAT ATA ATC AGA TCA ATT CGT GGG ATT TTT 2 05 

Gin Ala Ser Thr lie Thr Asn lie lie Arg Ser lie Arg Gly lie Phe 
35 40 45 

ACA AAA ATT GCA AAG AAA GTT AGA GGA CTT GTA AAA AAA CAC CCC AAG 253 
Thr Lys lie Ala Lys Lys Val Arg Gly Leu Val Lys Lys His Pro Lys 
50 55 60 

AAA AGC AGT GCG GCA TTA GTA GTA TTG ACC CAT ATT GCG TGC AAG AAA 3 01 

Lys Ser Ser Ala Ala Leu Val Val Leu Thr His lie Ala Cys Lys Lys 
65 70 75 

GCG AAA GAA TTA GAC GAT AAA GTC CAA GAT AAA TCC AAA CAA GCT GAA 34 9 

Ala Lys Glu Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu 
80 85 90 

AAA GAA AAT CAA ATC AAT TGG TGG AAA TAT TCA GGA TTA ACA ATA GCG 3 97 

Lys Glu Asn Gin lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala 
95 100 105 110 

ACA AGT TTA TTA TTA GCC GCT TGT AGC ACT GGT GAT GTT AGT GAA CAA 445 
Thr Ser Leu Leu Leu Ala Ala Cys Ser Thr Gly Asp Val Ser Glu Gin 
115 120 125 

ATA GAA CTA GAA CAA GAA AAA CAA AAG ACG AGC AAT ATA GAG ACT AAC 4 93 

lie Glu Leu Glu Gin Glu Lys Gin Lys Thr Ser Asn lie Glu Thr Asn 
130 135 140 
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AAT CAA ATA AAA GTA GAA CAA GAA AAA CAA AAG ACA AGC AAT ATA GAG 541 

Asn Gin lie Lys Val Glu Gin Glu Lys Gin Lys Thr Ser Asn lie Glu 

145 150 155 

ACT AAT AAT CAA ATA AAA GTA GAA CAA GAA CAA CAA AAG ACA AGC AAT 5 89 

Thr Asn Asn Gin He Lys Val Glu Gin Glu Gin Gin Lys Thr Ser Asn 

160 165 170 

ACA CAG AAA GAT TTG GTT AAA GAA CAG AAA GAT TTG GTT AAA GAA CAG 63 7 

Thr Gin Lys Asp Leu Val Lys Glu Gin Lys Asp Leu Val Lys Glu Gin 
175 180 185 190 

AAA GAT TTG GTT AAA GAA CAG AAA GAT TTG GTT AAA GAA CAG AAA GAT 68 5 

Lys Asp Leu Val Lys Glu Gin Lys Asp Leu Val Lys Glu Gin Lys Asp 
195 200 205 

TTG GTT AAA ACA CAG AAA GAT TTC ATT AAA TAT GTA GAA CAA AAT TGC 73 3 

Leu Val Lys Thr Gin Lys Asp Phe He Lys Tyr Val Glu Gin Asn Cys 
210 215 220 

CAA GAA AAT CAT AAT CAA TTC TTT ATT GAA AAA GGA GGA ATT AAG GCT 781 
Gin Glu Asn His Asn Gin Phe Phe He Glu Lys Gly Gly He Lys Ala 
225 230 235 

GGT ATT GGT ATA GAA GTA GAA GCT GAA TGC AAA ACC CCT AAA CCT GCA 82 9 

Gly He Gly He Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala 
240 245 250 

AAA ACC AAT CAA ACC CCT ATC CAG CCA AAA CAC CTC CCA AAC TCT AAA 8 77 

Lys Thr Asn Gin Thr Pro He Gin Pro Lys His Leu Pro Asn Ser Lys 
255 260 265 270 

CAA CCC CGC TCT CAA AGA GGA TCA AAA GCG CAA GAG CTT ATC GCT TAT 92 5 

Gin Pro Arg Ser Gin Arg Gly Ser Lys Ala Gin Glu Leu He Ala Tyr 
275 280 285 

TTG CAA AAA GAG CTA GAA TTT CTG CCC TAT TCG CAA AAA GCT ATC GCT 97 3 

Leu Gin Lys Glu Leu Glu Phe Leu Pro Tyr Ser Gin Lys Ala He Ala 
290 295 300 

AAA CAA GTG GAT TTT TAC AGG CCA AGT TCT ATC GCT TAT TTA GAA CTA 1021 
Lys Gin Val Asp Phe Tyr Arg Pro Ser Ser He Ala Tyr Leu Glu Leu 
305 310 315 

GAT CCT AGA GAT TTT AAG GTT ACA GAA GAA TGG CAA AAA GAA AAT CTA 10 6 9 

Asp Pro Arg Asp Phe Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu 
320 325 330 

AAA ATA CGC TCT AAA GCT CAA GCT AAA ATG CTT GAA ATG AGA AAC CCA 1117 
Lys He Arg Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Asn Pro 
335 340 345 350 

CAA GCC CAC CTT TCA AAC TCT CAA AGC CTT TTG TTC GTT CAA AAA ATA 116 5 

Gin Ala His Leu Ser Asn Ser Gin Ser Leu Leu Phe Val Gin Lys He 
355 360 365 
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TTT GCT GAT GTT AAT AAA GAA ATA GAA GCA GTT GCT AAT ACT GAA AAG 1213 

Phe Ala Asp Val Asn Lys Glu lie Glu Ala Val Ala Asn Thr Glu Lys 

370 375 380 

AAA GCA GAA AAA GCG GGT TAT GGT TAT AGT AAA AGG ATG TAG CG G TTAA AA 12 64 
Lys Ala Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 
385 390 395 

ACATTGCACC AAGTTTTTAA TTATCTGTCG GCTTTTGAAA ACATTTTTTA TGG TAGCGTT 13 2 4 
ATTTGGCAAT AAAAG 13 3 9 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Met Glu Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys Asn Thr 

15 10 15 

Glu Met Ala Asn Thr Lys Ala Asn Lys Glu Ala His Phe Lys Gin Ala 

20 25 30 

Ser Thr lie Thr Asn lie lie Arg Ser lie Arg Gly lie Phe Thr Lys 

35 40 45 

lie Ala Lys Lys Val Arg Gly Leu Val Lys Lys His Pro Lys Lys Ser 

50 55 60 

Ser Ala Ala Leu Val Val Leu Thr His lie Ala Cys Lys Lys Ala Lys 
65 70 75 80 

Glu Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu 

85 90 95 

Asn Gin lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Thr Ser 

100 105 110 

Leu Leu Leu Ala Ala Cys Ser Thr Gly Asp Val Ser Glu Gin lie Glu 

115 120 125 

Leu Glu Gin Glu Lys Gin Lys Thr Ser Asn lie Glu Thr Asn Asn Gin 

130 135 140 

lie Lys Val Glu Gin Glu Lys Gin Lys Thr Ser Asn lie Glu Thr Asn 
145 150 155 160 

Asn Gin lie Lys Val Glu Gin Glu Gin Gin Lys Thr Ser Asn Thr Gin 

165 170 175 

Lys Asp Leu Val Lys Glu Gin Lys Asp Leu Val Lys Glu Gin Lys Asp 

180 185 190 

Leu Val Lys Glu Gin Lys Asp Leu Val Lys Glu Gin Lys Asp Leu Val 

195 200 205 

Lys Thr Gin Lys Asp Phe lie Lys Tyr Val Glu Gin Asn Cys Gin Glu 

210 215 220 

Asn His _Asn Gin Phe Phe lie Glu Lys Gly Gly lie Lys Ala Gly lie 
225 230 235 240 

Gly lie Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr 
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245 250 255 

Asn Gin Thr Pro lie Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro 

260 265 270 

Arg Ser Gin Arg Gly Ser Lys Ala Gin Glu Leu lie Ala Tyr Leu Gin 

275 280 285 

Lys Glu Leu Glu Phe Leu Pro Tyr Ser Gin Lys Ala lie Ala Lys Gin 

290 295 300 

Val Asp Phe Tyr Arg Pro Ser Ser lie Ala Tyr Leu Glu Leu Asp Pro 
305 310 315 320 

Arg Asp Phe Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys lie 

325 330 335 

Arg Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Asn Pro Gin Ala 

340 345 350 

His Leu Ser Asn Ser Gin Ser Leu Leu Phe Val Gin Lys lie Phe Ala 

355 360 365 

Asp Val Asn Lys Glu He Glu Ala Val Ala Asn Thr Glu Lys, Lys Ala 

370 375 380 

Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 7 0... 8 64 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

TAATAACTCA ATCCCATTTG AATGGCATTT TTAAGCCAAA TTG CTACTAT CTTTGGCTAA 6 0 

AGGTTAAAC ATG ATT AAA CAA ACC CTC ATC ATT CTT GCC CCT TTT TTT ATC 111 
Met lie Lys Gin Thr Leu lie lie Leu Ala Pro Phe Phe lie 
15 10 

GCA ACG CTG TTG TAT TTT TTA GGC GCA CCG GAT GGG TTA AGA CCT AAC 15 9 

Ala Thr Leu Leu Tyr Phe Leu Gly Ala Pro Asp Gly Leu Arg Pro Asn 
15 20 25 30 

GCT TGG CTT TAT TTT TGT ATT TTC ATG GGC ATG ATT ATA GGG CTA ATT 2 07 

Ala Trp Leu Tyr Phe Cys He Phe Met Gly Met He He Gly Leu He 
35 40 45 

TTA GAG CCG GTG CCA TCA GGT TTA ATA GCG CTA AGC GCG TTA GTG CTG 255 
Leu Glu Pro Val Pro Ser Gly Leu He Ala Leu Ser Ala Leu Val Leu 
50 55 60 
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TGT ATA GCG TTA AAA ATT GGA GCG AGC GAT AAA GTA GCG AGC GCT AAT 3 03 

Cys lie Ala Leu Lys lie Gly Ala Ser Asp Lys Val Ala Ser Ala Asn 
65 70 75 

AAG GCT ATT TCG TGG GGT TTG AGC GGG TAT GCG AAT AAA ACG GTG TGG 351 
Lys Ala lie Ser Trp Gly Leu Ser Gly Tyr Ala Asn Lys Thr Val Trp 
80 85 90 

CTT GTG TTT GTC GCT TTC ATT TTG GGT TTA GGG TAT GAA AAA AGC TTG 3 99 

Leu Val Phe Val Ala Phe lie Leu Gly Leu Gly Tyr Glu Lys Ser Leu 
95 100 105 110 

TTA GGG AAA CGG ATC GCT CTT TTA CTG ATT AGG TTT TTA GGG CAA ACC 44 7 

Leu Gly Lys Arg lie Ala Leu Leu Leu lie Arg Phe Leu Gly Gin Thr 
115 120 125 

CCT TTA GGT TTA GGC TAT GCG ATT GGT TTG AGC GAA TTG TGT CTA GCC 4 95 

Pro Leu Gly Leu Gly Tyr Ala lie Gly Leu Ser Glu Leu Cys Leu Ala 
130 135 140 

CCT TTT ATC CCT AGC AAC TCC GCT AGA AGT GGA GGC ATA CTC TAT CCC 54 3 

Pro Phe lie Pro Ser Asn Ser Ala Arg Ser Gly Gly lie Leu Tyr Pro 
145 150 155 

ATC GTT TCA TCT ATC CCG CCT TTA ATG GGA TCT ACT CCA AAT AAT AAC 591 
lie Val Ser Ser lie Pro Pro Leu Met Gly Ser Thr Pro Asn Asn Asn 
160 165 170 

CCT GAC AAA ATC GGC GCG TAT TTG ATG TGG GTC GCT TTG GCT TCA ACT 63 9 

Pro Asp Lys lie Gly Ala Tyr Leu Met Trp Val Ala Leu Ala Ser Thr 
175 180 185 190 

TGC ATC ACT TCG TCC ATG TTT TTA ACC GCG CTC GCT CCT AAC CCC CTA 687 
Cys He Thr Ser Ser Met Phe Leu Thr Ala Leu Ala Pro Asn Pro Leu 
195 200 205 

GCA ATG GAA ATC GCT GCC AAA ATG GGC GTG AAT GAA ATC TCA TGG TTT 73 5 

Ala Met Glu He Ala Ala Lys Met Gly Val Asn Glu He Ser Trp Phe 
210 215 220 

TCG TGG TTT TTA GCG TTC TTG CCT TGT GGG GTG GTT TTG ATC TTG CTT 78 3 

Ser Trp Phe Leu Ala Phe Leu Pro Cys Gly Val Val Leu He Leu Leu 
225 230 235 

GTG CCT TTA TTG GCG TAT AAA ACC TGC AAA CCC ACC TTA AAA GGC TCA 831 
Val Pro Leu Leu Ala Tyr Lys Thr Cys Lys Pro Thr Leu Lys Gly Ser 
240 245 250 

AAA GAA GTG AGT TTG TGG GCC AAA AAA AGG AAT TAGAGGGCAT GGGGAGGTTT 8 84 

Lys Glu Val Ser Leu Trp Ala Lys Lys Arg Asn 
255 260 265 

TCTTTAAAAG AAATTTTAAT 904 
(2) INFORMATION FOR SEQ ID NO : 7 6 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met lie Lys Gin Thr Leu lie lie Leu Ala Pro Phe Phe lie Ala Thr 

1 5 10 15 

Leu Leu Tyr Phe Leu Gly Ala Pro Asp Gly Leu Arg Pro Asn Ala Trp 

20 25 30 

Leu Tyr Phe Cys lie Phe Met Gly Met lie lie Gly Leu lie Leu Glu 

35 40 45 

Pro Val Pro Ser Gly Leu lie Ala Leu Ser Ala Leu Val Leu Cys lie 

50 55 60 

Ala Leu Lys lie Gly Ala Ser Asp Lys Val Ala Ser Ala Asn Lys Ala 
65 70 75 80 

lie Ser Trp Gly Leu Ser Gly Tyr Ala Asn Lys Thr Val Trp Leu Val 

85 90 95 

Phe Val Ala Phe lie Leu Gly Leu Gly Tyr Glu Lys Ser Leu Leu Gly 

100 105 HO 

Lys Arg lie Ala Leu Leu Leu lie Arg Phe Leu Gly Gin Thr Pro Leu 

115 120 125 

Gly Leu Gly Tyr Ala lie Gly Leu Ser Glu Leu Cys Leu Ala Pro Phe 

130 135 140 

lie Pro Ser Asn Ser Ala Arg Ser Gly Gly lie Leu Tyr Pro lie Val 
145 150 155 160 

Ser Ser He Pro Pro Leu Met Gly Ser Thr Pro Asn Asn Asn Pro Asp 

165 170 175 

Lys He Gly Ala Tyr Leu Met Trp Val Ala Leu Ala Ser Thr Cys He 

180 185 190 

Thr Ser Ser Met Phe Leu Thr Ala Leu Ala Pro Asn Pro Leu Ala Met 

195 200 205 

Glu lie Ala Ala Lys Met Gly Val Asn Glu He Ser Trp Phe Ser Trp 

210 215 220 

Phe Leu Ala Phe Leu Pro Cys Gly Val Val Leu He Leu Leu Val Pro 
225 230 235 240 

Leu Leu Ala Tyr Lys Thr Cys Lys Pro Thr Leu Lys Gly Ser Lys Glu 

245 250 255 

Val Ser Leu Trp Ala Lys Lys Arg Asn 
260 265 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : linear 

(ix) FEATURE: 
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( A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 152 ... 1069 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

TTTTAAGCGG TTTCCCTAAA ATAGGTTTTT AATCAATTTA ATCCAAAGTT GAATTTATTT 6 0 

TTTGACAATA TTATACTATA ATAACCAATT AGATTGGGGT TTTACTGATT TTTCTTTGTG 12 0 

TG AG CTTTGG CTTAGTTTTG TAAGGAATGA G ATG ATA AAG AGT TGG ACT AAA 172 

Met lie Lys Ser Trp Thr Lys 
1 5 

AAG TGG TTT TTG ATT TTA TTT TTA ATG GCA AGT TGT TCC AGT TAT TTG 220 
Lys Trp Phe Leu lie Leu Phe Leu Met Ala Ser Cys Ser Ser Tyr Leu 
10 15 20 

GTG GCT ACA ACC GGT GAG AAA TAT TTT AAA ATG GCT ACT CAA GCC TTT 268 
Val Ala Thr Thr Gly Glu Lys Tyr Phe Lys Met Ala Thr Gin Ala Phe 
25 30 35 

AAG AGA GGG GAC TAC CAT AAA GCG GTG GCT TTT TAT AAG AGG AGC TGT 316 
Lys Arg Gly Asp Tyr His Lys Ala Val Ala Phe Tyr Lys Arg Ser Cys 
40 45 50 55 

AAT TTA AGG GTG GGG GTT GGT TGC ACG AGT TTA GGC TCT ATG TAT GAA 3 64 

Asn Leu Arg Val Gly Val Gly Cys Thr Ser Leu Gly Ser Met Tyr Glu 
60 65 70 

GAT GGC GAT GGC GTG GAT CAG AAT ATT ACA AAA GCC GTT TTT TAT TAC 412 
Asp Gly Asp Gly Val Asp Gin Asn lie Thr Lys Ala Val Phe Tyr Tyr 
75 80 85 

AGA AGA GGG TGT AAT TTA AGG AAT CAT CTC GCT TGC GCG AGT CTA GGC 4 60 

Arg Arg Gly Cys Asn Leu Arg Asn His Leu Ala Cys Ala Ser Leu Gly 
90 95 100 

TCT ATG TAT GAA GAT GGC GAT GGT GTG CAA AAA AAC CTT CCA AAG GCT 5 08 

Ser Met Tyr Glu Asp Gly Asp Gly Val Gin Lys Asn Leu Pro Lys Ala 
105 110 115 

ATC TAT TAT TAC AGG AGA GGG TGC CAC TTA AAG GGT GGG GTG AGC TGT 5 56 

lie Tyr Tyr Tyr Arg Arg Gly Cys His Leu Lys Gly Gly Val Ser Cys 
120 125 130 135 

GGG AGT TTA GGT TTT ATG TAT TTT AAT GGC ACG GGC GTT AAG CAA AAT 604 
Gly Ser Leu Gly Phe Met Tyr Phe Asn Gly Thr Gly Val Lys Gin Asn 
140 145 150 

TAT GCC AAA GCC CTT TTT CTT TCT AAA TAC GCT TGC AGT TTG AAT TAC 652 
Tyr Ala Lys Ala Leu Phe Leu Ser Lys Tyr Ala Cys Ser Leu Asn Tyr 
155 160 165 

GGC ATT AGT TGT AAC TTT GTA GGG TAT ATG TAT AGG AAC GCC AAA GGC 7 00 



BNSDOCID: <WO 9821225A1 _l_> 



WO 98/21225 



PCT/US97/21353 - 



-188- 

Gly lie Ser Cys Asn Phe Val Gly Tyr Met Tyr Arg Asn Ala Lys Gly 
170 1*75 180 

GTA CAG AAG GAT TTG AAA AAA GCC CTT GCG AAT TTT AAA AGA GGG TGC 74 8 

Val Gin Lys Asp Leu Lys Lys Ala Leu Ala Asn Phe Lys Arg Gly Cys 
185 190 195 

CAT TTG AAA GAC GGA GCG AGT TGT GTG AGC TTG GGA TAC ATG TAT GAA 7 96 

His Leu Lys Asp Gly Ala Ser Cys Val Ser Leu Gly Tyr Met Tyr Glu 
200 205 210 215 

GTC GGT ATG GAT GTC AAA CAA AAT GGA GAG CAA GCC TTG AAT CTT TAT 84 4 

Val Gly Met Asp Val Lys Gin Asn Gly Glu Gin Ala Leu Asn Leu Tyr 
220 225 230 

AAA AAG GGT TGT TAT TTA AAA AGG GGG AGC GGT TGT CAT AAT GTG GCG 8 92 

Lys Lys Gly Cys Tyr Leu Lys Arg Gly Ser Gly Cys His Asn Val Ala 
235 240 245 

GTG ATG TAT TAC ACC GGT AAG GGC GTT CCA AAG GAT TTA GAT AAA GCC 94 0 

Val Met Tyr Tyr Thr Gly Lys Gly Val Pro Lys Asp Leu Asp Lys Ala 
250 255 260 

ATT TCG TAT TAT AAG AAA GGT TGC ACT CTA GGC TTT AGT GGT AGC TGT 98 8 

He Ser Tyr Tyr Lys Lys Gly Cys Thr Leu Gly Phe Ser Gly Ser Cys 
265 270 275 

AAA GTG TTA GAA GAA GTG ATT GGC AAG AAG TCT GAT GAT TTG CAA GAT 103 6 

Lys Val Leu Glu Glu Val He Gly Lys Lys Ser Asp Asp Leu Gin Asp 
280 285 290 295 

GAC GCG CAA AAC GAC ACG CAA GAT GAT ATG CAA TAAGTTAAAG CTTATGGACT 108 9 
Asp Ala Gin Asn Asp Thr Gin Asp Asp Met Gin 
300 305 

AATGATTAAA ACTCATCTTA TAGAAATCTT TCTACTCTCT TGTTATCAAA TAGGGATTAA 114 9 
GCGTCTCTAT TGATGGGTAT TGAGACTAAA AATCTGCAAA TCTAG 1194 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 06 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Met He Lys Ser Trp Thr Lys Lys Trp Phe Leu He Leu Phe Leu Met 

15 10 15 

Ala Ser Cys Ser Ser Tyr Leu Val Ala Thr Thr Gly Glu Lys Tyr Phe 

20 25 30 

Lys Met Ala Thr Gin Ala Phe Lys Arg Gly Asp Tyr His Lys Ala Val 
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35 40 45 

Ala Phe Tyr Lys Arg Ser Cys Asn Leu Arg Val Gly Val Gly Cys Thr 

50 55 60 

Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val Asp Gin Asn lie 
65 70 75 80 

Thr Lys Ala Val Phe Tyr Tyr Arg Arg Gly Cys Asn Leu Arg Asn His 

85 90 95 

Leu Ala Cys Ala Ser Leu Gly Ser Met Tyr Glu Asp Gly Asp Gly Val 

100 105 110 

Gin Lys Asn Leu Pro Lys Ala lie Tyr Tyr Tyr Arg Arg Gly Cys His 

115 120 125 

Leu Lys Gly Gly Val Ser Cys Gly Ser Leu Gly Phe Met Tyr Phe Asn 

130 135 140 

Gly Thr Gly Val Lys Gin Asn Tyr Ala Lys Ala Leu Phe Leu Ser Lys 
145 150 155 160 

Tyr Ala Cys Ser Leu Asn Tyr Gly lie Ser Cys Asn Phe Val Gly Tyr 

165 170 175 

Met Tyr Arg Asn Ala Lys Gly Val Gin Lys Asp Leu Lys Lys Ala Leu 

180 185 190 

Ala Asn Phe Lys Arg Gly Cys His Leu Lys Asp Gly Ala Ser Cys Val 

195 200 205 

Ser Leu Gly Tyr Met Tyr Glu Val Gly Met Asp Val Lys Gin Asn Gly 

210 215 220 

Glu Gin Ala Leu Asn Leu Tyr Lys Lys Gly Cys Tyr Leu Lys Arg Gly 
225 230 235 240 

Ser Gly Cys His Asn Val Ala Val Met Tyr Tyr Thr Gly Lys Gly Val 

245 250 255 

Pro Lys Asp Leu Asp Lys Ala lie Ser Tyr Tyr Lys Lys Gly Cys Thr 

260 265 270 

Leu Gly Phe Ser Gly Ser Cys Lys Val Leu Glu Glu Val lie Gly Lys 

275 280 285 

Lys Ser Asp Asp Leu Gin Asp Asp Ala Gin Asn Asp Thr Gin Asp Asp 

290 295 300 

Met Gin 
305 

(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 101... 865 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 79: 



TTTGTTTATA AGAAAAATTA TTTCAAATGT AGTAGAATTA AGG CAGTGTT TTTGCGTCAA 60 
GCGATTTTAG GTTAATTTTG AGTTTTTAGG AGCAGTTTTT ATG CAA CAA GAA GAG 115 
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Met Gin Gin Glu Glu 
1 5 

ATT ATA GAG GGT TAT TAT GGT GCT AGC AAA GGG CTT AAA AAG AGC GGT 16 3 

lie lie Glu Gly Tyr Tyr Gly Ala Ser Lys Gly Leu Lys Lys Ser Gly 
10 15 20 

ATT TAT GCC AAG CTG GAT TTT TTA CAG AGC GCT ACG GGC TTG ATT TTA 211 
lie Tyr Ala Lys Leu Asp Phe Leu Gin Ser Ala Thr Gly Leu lie Leu 
25 30 35 

GCG CTC TTT ATG ATA GCA CAC ATG TTT TTA GTC TCA AGT ATC TTG ATT 25 9 

Ala Leu Phe Met lie Ala His Met Phe Leu Val Ser Ser lie Leu lie 
40 45 50 

AGC GAT GAA GCC ATG TAT AAA GTG GCG AAA TTT TTT GAA GGG AGC TTG 3 07 

Ser Asp Glu Ala Met Tyr Lys Val Ala Lys Phe Phe Glu Gly Ser Leu 
55 60 65 

TTT TTA AAA GCG GGC GAG CCG GCT ATT GTG AGC GTG GTT GCA GCA GGG 355 
Phe Leu Lys Ala Gly Glu Pro Ala He Val Ser Val Val Ala Ala Gly 
70 75 80 85 

ATT ATT CTT ATT TTA GTC GCG CAT GCT TTT TTG GCG TTA AGG AAA TTC 4 03 

He He Leu lie Leu Val Ala His Ala Phe Leu Ala Leu Arg Lys Phe 
90 95 100 

CCT ATC AAT TAC AGG CAA TAC AAG GTT TTT AAA ACC CAT AAG CAT TTG 451 
Pro He Asn Tyr Arg Gin Tyr Lys Val Phe Lys Thr His Lys His Leu 
105 HO 115 

ATG AAA CAT GGC GAT ACG AGC TTG TGG TTT ATT CAA GCC CTC ACC GGG 4 99 

Met Lys His Gly Asp Thr Ser Leu Trp Phe He Gin Ala Leu Thr Gly 
120 125 130 

TTT GCG ATG TTT TTC TTA GCG AGT ATC CAC TTA TTT GTC ATG CTC ACA 54 7 

Phe Ala Met Phe Phe Leu Ala Ser lie His Leu Phe Val Met Leu Thr 
135 140 145 

GAG CCT GAA AGT ATT GGG CCT CAT GGT TCA AGC TAT CGT TTT GTA ACG 5 95 

Glu Pro Glu Ser He Gly Pro His Gly Ser Ser Tyr Arg Phe Val Thr 
150 155 160 165 

CAA AAC TTT TGG CTT TTG TAT ATT TTC TTA TTG TTT GCC GTA GAA TTG 64 3 

Gin Asn Phe Trp Leu Leu Tyr He Phe Leu Leu Phe Ala Val Glu Leu 
170 175 180 

CAT GGC TCT ATT GGG TTG TAT CGT TTA GCG ATC AAA TGG GGG TGG TTT 6 91 

His Gly Ser He Gly Leu Tyr Arg Leu Ala He Lys Trp Gly Trp Phe 
185 190 195 

AAA AAT GTG AGC ATT CAA GGT TTG AGA AAA GTC AAA TGG GCG ATG AGC 73 9 

Lys Asn Val Ser He Gin Gly Leu Arg Lys Val Lys Trp Ala Met Ser 
200 205 210 

GTG TTT TTT ATT GTT TTA GGG CTT TGC ACC TAT GGG GCT TAC ATT AAA 7 87 
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Val Phe Phe He Val Leu Gly Leu Cys Thr Tyr Gly Ala Tyr He Lys 
215 220 225 

AAA GGT TTA GAA AAT AAG GAA AAT GGC ATT AAA ACC ATG CAA GAA GCC 8 35 

Lys Gly Leu Glu Asn Lys Glu Asn Gly He Lys Thr Met Gin Glu Ala 
230 235 240 245 

ATA GAA GCT GAT GGG AAA TTC CAC AAA GAA TAAGGGTAGA AAATGAAAAT AAC 8 88 
He Glu Ala Asp Gly Lys Phe His Lys Glu 
250 255 

ATATTGTGAT GCGCTAATTA TTGGAGGCGG ACTAGCTGGG TTAAGGGCTA GTATCGCATG 94 8 

CAAACAAAAG GGTTTAAACA CCATCGTTTT AAGCCTAGTG CCTGTCAGGC GTT 1001 

(2) INFORMATION FOR SEQ ID NO : 8 0 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 255 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0 : 

Met Gin Gin Glu Glu He He Glu Gly Tyr Tyr Gly Ala Ser Lys Gly 

15 10 15 

Leu Lys Lys Ser Gly lie Tyr Ala Lys Leu Asp Phe Leu Gin Ser Ala 

20 25 30 

Thr Gly Leu He Leu Ala Leu Phe Met He Ala His Met Phe Leu Val 

35 40 45 

Ser Ser He Leu He Ser Asp Glu Ala Met Tyr Lys Val Ala Lys Phe 

50 55 60 

Phe Glu Gly Ser Leu Phe Leu Lys Ala Gly Glu Pro Ala He Val Ser 
65 70 75 80 

Val Val Ala Ala Gly He He Leu He Leu Val Ala His Ala Phe Leu 

85 90 95 

Ala Leu Arg Lys Phe Pro He Asn Tyr Arg Gin Tyr Lys Val Phe Lys 

100 105 110 

Thr His Lys His Leu Met Lys His Gly Asp Thr Ser Leu Trp Phe He 

115 120 125 

Gin Ala Leu Thr Gly Phe Ala Met Phe Phe Leu Ala Ser He His Leu 

130 135 140 

Phe Val Met Leu Thr Glu Pro Glu Ser He Gly Pro His Gly Ser Ser 
145 150 155 160 

Tyr Arg Phe Val Thr Gin Asn Phe Trp Leu Leu Tyr He Phe Leu Leu 

165 170 175 

Phe Ala Val Glu Leu His Gly Ser He Gly Leu Tyr Arg Leu Ala He 

180 185 190 

Lys Trp Gly Trp Phe Lys Asn Val Ser He Gin Gly Leu Arg Lys Val 

195 200 205 

Lys Trp Ala Met Ser Val Phe Phe lie Val Leu Gly Leu Cys Thr Tyr 
210 215 220 
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Gly Ala Tyr lie Lys Lys Gly Leu Glu Asn Lys Glu Asn Gly lie Lys 
225 230 235 240 

Thr Met Gin Glu Ala He Glu Ala Asp Gly Lys Phe His Lys Glu 
245 250 255 

(2) INFORMATION FOR SEQ ID NO : 8 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 975 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 82... 912 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



60 



TTTTAAAATT AAAGAAAATT TTTTTTAAAG ATTATCACTC TTTTTTGATA AAGTAATCAT 
TTAAAATTTA GGGAGTTTTT T ATG GAA GAA TCA ACA GCG TTT ATT TTG GCT 111 

Met Glu Glu Ser Thr Ala Phe He Leu Ala 
15 10 

CTT GTG GGG CTA TTC ACC GGC ATT ACC GCC GGG TTT TTT GGT ATT GGT 15 9 

Leu Val Gly Leu Phe Thr Gly He Thr Ala Gly Phe Phe Gly lie Gly 
15 20 25 

GGG GGG GAG ATT GTC GTC CCT AGC GCG ATT TTT GCC CAT TTT AGC TAT 2 07 

Gly Gly Glu He Val Val Pro Ser Ala He Phe Ala His Phe Ser Tyr 
30 35 40 

AGC CAT GCG GTG GGT ATT TCG CTC ATG CAA ATG CTT TTT TCT TCA GTG 2 55 

Ser His Ala Val Gly He Ser Leu Met Gin Met Leu Phe Ser Ser Val 

45 50 55 

GTC GGC TCT ATC ATC AAT TAC AAA AAG GGC TTA TTG GAT TTG AGA GAA 3 03 

Val Gly Ser He He Asn Tyr Lys Lys Gly Leu Leu Asp Leu Arg Glu 

60 65 70 

GGC TCA TTT GCC GCG CTT GGA GGG CTA ATG GGA GCG ATT TTA GGG AGC 3 51 

Gly Ser Phe Ala Ala Leu Gly Gly Leu Met Gly Ala He Leu Gly Ser 
75 80 85 90 

TTT ATC TTA AAA ATC ATT GAC GAT AAA ATT TTA ATG GCG GTG TTT GTG 3 99 

Phe He Leu Lys He He Asp Asp Lys He Leu Met Ala Val Phe Val 
95 100 105 

GTG GTG GTG TGC TAC ACC TTT ATC AAA TAC GCT TTT TCT AGC AAC AAG 44 7 

Val Val Val Cys Tyr Thr Phe He Lys Tyr Ala Phe Ser Ser Asn Lys 
110 H5 120 
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AAA CCC AAG CAT TTT GAA GAA ATG CAT TTT GAT TTG CAT GCG AAT AAC 4 95 

Lys Pro Lys His Phe Glu Glu Met His Phe Asp Leu His Ala Asn Asn 
125 130 135 

AAA ACG CCC GAA AAA AAG CGC GCA ATC CCT TTT GTG TCT ATG GAT AGA 54 3 

Lys Thr Pro Glu Lys Lys Arg Ala lie Pro Phe Val Ser Met Asp Arg 
140 145 150 

ACG CAT GGG GTT TTG ATG CTC GCC GGT TTT GTT ACC GGC ATC TTT TCT 5 91 

Thr His Gly Val Leu Met Leu Ala Gly Phe Val Thr Gly lie Phe Ser 
155 160 165 170 

ATC CCA CTA GGC ATG GGT GGA GGG ATT TTA ATG GTG CCG TTT TTG GGC 63 9 

lie Pro Leu Gly Met Gly Gly Gly lie Leu Met Val Pro Phe Leu Gly 
175 180 185 

TAT TTT TTG AAA TAC GAT TCT AAA AAA ATC GTG CCT TTG GGG CTA TTT 68 7 

Tyr Phe Leu Lys Tyr Asp Ser Lys Lys lie Val Pro Leu Gly Leu Phe 
190 195 200 

TTT GTG GTG TTC GCT TCT TTA TCT GGG GTC ATC TCT CTT TAT AAC GGG 73 5 

Phe Val Val Phe Ala Ser Leu Ser Gly Val lie Ser Leu Tyr Asn Gly 
205 210 215 

AGG GTT CTT GAT AAT ATA AGC GTT CAA GCG GGG GTG ATT ACC GGC ATT 7 83 

Arg Val Leu Asp Asn lie Ser Val Gin Ala Gly Val lie Thr Gly lie 
220 225 230 

GGA GCG TTT TTA GGC GTG GGC ATT GGC ATC AAG CTT ATC GCT TTG GCT 831 
Gly Ala Phe Leu Gly Val Gly lie Gly lie Lys Leu lie Ala Leu Ala 
235 240 245 250 

AAT GAA AAG GTG CAT AAA ATC CTG TTG CTC CTT ATT TAT GCT TTA AGC 87 9 

Asn Glu Lys Val His Lys lie Leu Leu Leu Leu lie Tyr Ala Leu Ser 
255 260 265 

ATT TTA GCG ACT TTA CAC AAG CTC ATT ATG GGG TAAATCTAAA AACGCTTCTA 93 2 

lie Leu Ala Thr Leu His Lys Leu lie Met Gly 
270 275 

GGGCATTTTT AAAATTAATA TCAAAGAGCT TTCACCAGCA AGC 975 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: 
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Met Glu Glu Ser Thr Ala Phe lie Leu Ala Leu Val Gly Leu Phe Thr 

15 10 15 

Gly lie Thr Ala Gly Phe Phe Gly He Gly Gly Gly Glu He Val Val 

20 25 30 

Pro Ser Ala He Phe Ala His Phe Ser Tyr Ser His Ala Val Gly He 

35 40 45 

Ser Leu Met Gin Met Leu Phe Ser Ser Val Val Gly Ser He lie Asn 

50 55 60 

Tyr Lys Lys Gly Leu Leu Asp Leu Arg Glu Gly Ser Phe Ala Ala Leu 
65 70 75 80 

Gly Gly Leu Met Gly Ala He Leu Gly Ser Phe He Leu Lys lie He 

85 90 9S 

Asp Asp Lys lie Leu Met Ala Val Phe Val Val Val Val Cys Tyr Thr 

100 105 110 

Phe He Lys Tyr Ala Phe Ser Ser Asn Lys Lys Pro Lys His Phe Glu 

115 120 125 

Glu Met His Phe Asp Leu His Ala Asn Asn Lys Thr Pro Glu Lys Lys 

130 135 140 

Arg Ala He Pro Phe Val Ser Met Asp Arg Thr His Gly Val Leu Met 
145 150 155 160 

Leu Ala Gly Phe Val Thr Gly He Phe Ser He Pro Leu Gly Met Gly 

165 170 175 

Gly Gly He Leu Met Val Pro Phe Leu Gly Tyr Phe Leu Lys Tyr Asp 

180 185 190 

Ser Lys Lys He Val Pro Leu Gly Leu Phe Phe Val Val Phe Ala Ser 

195 200 205 

Leu Ser Gly Val He Ser Leu Tyr Asn Gly Arg Val Leu Asp Asn He 

210 215 220 

Ser Val Gin Ala Gly Val He Thr Gly He Gly Ala Phe Leu Gly Val 
225 230 235 240 

Gly He Gly He Lys Leu lie Ala Leu Ala Asn Glu Lys Val His Lys 

245 250 255 

lie Leu Leu Leu Leu He Tyr Ala Leu Ser He Leu Ala Thr Leu His 

260 265 270 

Lys Leu He Met Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 83: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 220... 1482 
(D) OTHER INFORMATION: 



-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
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AAGCGCGAGC TATATGAGGA ATTTTAGCTT CTATGTGGGC TATTCAGTCG GTTTTTAAGG 6 0 

AAGGCTCTTG ATGAAAAATA C C AAT AC AAA AGAGATAAAG AATACAAGAA TGAAAAAAGG 12 0 

TTATAGTCAA TACCACGCGC TCAAAAAAGG GCTTTTAAAA ACGCTCTGCT TTTTAGCCTT 180 
CCTTTAAGCG TGGCGTTAGC TGAAGACGAT GGCTTTTAT ATG GGA GTG GGC TAT 2 34 

Met Gly Val Gly Tyr 
1 5 

CAA ATC GGC GGC GCG CAA CAA AAT ATC GAT AAC AAA GGC AGC ACC CTA 2 82 

Gin lie Gly Gly Ala Gin Gin Asn lie Asp Asn Lys Gly Ser Thr Leu 
10 15 20 

AGG AAT AAT GTC ATT AAT AAT TTC CGC CAA GTG GGC GTG GGT ATG GCA 33 0 

Arg Asn Asn Val lie Asn Asn Phe Arg Gin Val Gly Val Gly Met Ala 
25 30 35 

GGG GGT AAT GGG CTT TTA GCC TTA GCG ACA AAC ACG ACC ATG GAC GCT 37 8 

Gly Gly Asn Gly Leu Leu Ala Leu Ala Thr Asn Thr Thr Met Asp Ala 
40 45 50 

CTT TTA GGG ATA GGC AAC CAA ATT GTC AAT ACT AAT ACA ACT GTT AGC 4 26 

Leu Leu Gly lie Gly Asn Gin lie Val Asn Thr Asn Thr Thr Val Ser 
55 60 65 

AAC AAC AAC GCA GAA TTA ACC CAG TTT AAA AAA ATA CTC CCT CAA ATT 4 74 

Asn Asn Asn Ala Glu Leu Thr Gin Phe Lys Lys lie Leu Pro Gin lie 
70 75 80 85 

GAG CAA CGC TTT GAA ACG AAT AAA AAC GCT TAT AGC GTT CAA GCC TTG 5 22 

Glu Gin Arg Phe Glu Thr Asn Lys Asn Ala Tyr Ser Val Gin Ala Leu 
90 95 100 

CAA GTG TAT TTG AGT AAT GTG CTT TAT AAC TTG GTT AAT AAT AGT AAT 57 0 

Gin Val Tyr Leu Ser Asn Val Leu Tyr Asn Leu Val Asn Asn Ser Asn 

105 HO 115 

AAT GGC AGT AAT AAT GGA GTC GTT CCT GAA TAT GTA GGA ATT ATA AAA 618 

Asn Gly Ser Asn Asn Gly Val Val Pro Glu Tyr Val Gly lie lie Lys 

120 125 130 

GTT CTC TAT GGT TCT CAA AAT GAA TTC AGT CTC TTA GCC ACG GAG AGT 6 66 

Val Leu Tyr Gly Ser Gin Asn Glu Phe Ser Leu Leu Ala Thr Glu Ser 
135 140 145 

GTG GTG CTT TTA AAC GCG CTT ACA AGG GTG AAT CTG GAT AGT AAT TCG 714 
Val Val Leu Leu Asn Ala Leu Thr Arg Val Asn Leu Asp Ser Asn Ser 
150 155 160 165 

GTG TTT TTA AAA GGG CTA TTA GCC CAA ATG CAG CTT TTT AAT GAC ACT 76 2 

Val Phe Leu Lys Gly Leu Leu Ala Gin Met Gin Leu Phe Asn Asp Thr 
170 175 180 

TCT TCA GCA AAG CTA GGC CAG ATC GCA GAA AAC TTG AAG AAC GGT GGT 810 
Ser Ser Ala Lys Leu Gly Gin lie Ala Glu Asn Leu Lys Asn Gly Gly 
185 190 195 

GCA GGA TCA ATG CTC CAA AAG GAT GTG AAA ACC ATC TCG GAT CGA ATC 85 8 
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Ala Gly Ser Met Leu Gin Lys Asp Val Lys Thr lie Ser Asp Arg lie 
200 205 210 

GCT ACT TAC CAA GAG AAT CTA AAA CAG CTA GGA GGG ATG CTA AAG AAT 90 6 

Ala Thr Tyr Gin Glu Asn Leu Lys Gin Leu Gly Gly Met Leu Lys Asn 
215 220 225 

TAC GAT GAA CCC TAC TTG CCC CAA TTT GGG CCA GGC AC A AGC TCT CAG 954 
Tyr Asp Glu Pro Tyr Leu Pro Gin Phe Gly Pro Gly Thr Ser Ser Gin 
230 235 240 245 

CAT GGG GTT ATT AAT GGC TTT GGC ATT CAA GTG GGC TAT AAG CAA TTT 10 02 

His Gly Val He Asn Gly Phe Gly He Gin Val Gly Tyr Lys Gin Phe 
250 255 260 

TTT GGG AAC AAG CGG AAT ATA GGC TTA CGA TAT TAC GCT TTC TTT GAT 105 0 

Phe Gly Asn Lys Arg Asn He Gly Leu Arg Tyr Tyr Ala Phe Phe Asp 
265 270 275 

TAT GGC TTT ACG CAA TTG GGC AGT CTT AGC AGC GCC GTT AAA GCG AAT 10 98 

Tyr Gly Phe Thr Gin Leu Gly Ser Leu Ser Ser Ala Val Lys Ala Asn 
280 285 290 

ATC TTT ACT TAT GGC GCT GGC ACG GAC TTT TTA TGG AAT ATC TTT AGA 114 6 

He Phe .Thr Tyr Gly Ala Gly Thr Asp Phe Leu Trp Asn He Phe Arg 
295 300 305 

AGG GTT TTT AGC GAT CAG TCC TTG AAT GTG GGG GTG TTT GGG GGC ATT 1194 
Arg Val Phe Ser Asp Gin Ser Leu Asn Val Gly Val Phe Gly Gly He 
310 315 320 325 

CAA ATA GCG GGT AAC ACT TGG GAT AGC TCT TTA AGA GGT CAA ATT GAA 124 2 

Gin He Ala Gly Asn Thr Trp Asp Ser Ser Leu Arg Gly Gin He Glu 
330 335 340 

AAC TCG TTT AAA GAA TAC CCC ACT CCC ACG AAT TTC CAA TTT TTG TTT 12 9 0 

Asn Ser Phe Lys Glu Tyr Pro Thr Pro Thr Asn Phe Gin Phe Leu Phe 
345 350 355 

AAT TTG GGT TTA AGG GCT CAT TTT GCC AGC ACC ATG CAC CGC CGG TTT 133 8 

Asn Leu Gly Leu Arg Ala His Phe Ala Ser Thr Met His Arg Arg Phe 
360 365 370 

TTG AGC GCG TCT CAA AGC ATT CAG CAT GGG ATG GAA TTT GGC GTG AAA 13 8 6 

Leu Ser Ala Ser Gin Ser He Gin His Gly Met Glu Phe Gly Val Lys 
375 380 385 

ATC CCG GCT ATC AAT CAA AGG TAT TTG AGG GCC AAT GGG GCT GAT GTG 14 34 

He Pro Ala He Asn Gin Arg Tyr Leu Arg Ala Asn Gly Ala Asp Val 
390 395 400 405 

GAT TAC AGG CGT TTG TAT GCG TTC TAT ATC AAT TAC ACG ATA GGT TTT T 14 8 3 
Asp Tyr Arg Arg Leu Tyr Ala Phe Tyr He Asn Tyr Thr He Gly Phe 
410 415 420 
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AAGCTCTTTT TAGGGCTTAT AAAGAGGCTT TTTACTTTTT TTTTGGTATT CTAACAAGCT 154 3 

TTTAAATAAT CCAATCTACT TTGTTTTAAG GATAATATTT TATGGCAGAT GTCGTTGTGG 16 0 3 

GGATCCAGTG GGGAGATGAG GGGAAGGGAA AAATTGTTGA TAGGATCGCT AAAGATTATG 16 6 3 

ACTT 1667 

(2) INFORMATION FOR SEQ ID NO: 84: 

( i ) SEQUENCE CHARACTER ISTICS : 

(A) LENGTH: 421 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



Met 


Gly 


Val 


Gly 


Tyr 


Gin 


He 


Gly 


Gly 


Ala 


Gin 


Gin 


Asn 


He 


Asp 


Asn 


1 








5 










10 










15 




Lys 


Gly 


Ser 


Thr 


Leu 


Arg 


Asn 


Asn 


Val 


He 


Asn 


Asn 


Phe 


Arg 


Gin 


Val 








20 










25 










30 






Gly 


Val 


Gly 


Met 


Ala 


Gly 


Gly 


Asn 


Gly 


Leu 


Leu 


Ala 


Leu 


Ala 


Thr 


Asn 






35 










40 










45 








Thr 


Thr 


Met 


Asp 


Ala 


Leu 


Leu 


Gly 


He 


Gly 


Asn 


Gin 


He 


Val 


Asn 


Thr 



50 55 60 



Asn Thr Thr Val Ser Asn Asn Asn Ala Glu Leu Thr Gin Phe Lys Lys 
65 70 75 80 



He 


Leu 


Pro 


Gin 


He 
85 


Glu 


Gin 


Arg 


Phe 


Glu 
90 


Thr 


Asn 


Lys 


Asn 


Ala 
95 


Tyr 


Ser 


Val 


Gin 


Ala 
100 


Leu 


Gin 


Val 


Tyr 


Leu 
105 


Ser 


Asn 


Val 


Leu 


Tyr 
110 


Asn 


Leu 


Val 


Asn 


Asn 
115 


Ser 


Asn 


Asn 


Gly 


Ser 
120 


Asn 


Asn 


Gly 


Val 


Val 
125 


Pro 


Glu 


Tyr 


Val 


Gly 
130 


He 


He 


Lys 


Val 


Leu 
135 


Tyr 


Gly 


Ser 


Gin 


Asn 
140 


Glu 


Phe 


Ser 


Leu 


Leu 


Ala 


Thr 


Glu 


Ser 


Val 


Val 


Leu 


Leu 


Asn 


Ala 


Leu 


Thr 


Arg 


Val 


Asn 


145 










150 










155 










160 


Leu 


Asp 


Ser 


Asn 


Ser 
165 


Val 


Phe 


Leu 


Lys 


Gly 
170 


Leu 


Leu 


Ala 


Gin 


Met 
175 


Gin 


Leu 


Phe 


Asn 


Asp 
180 


Thr 


Ser 


Ser 


Ala 


Lys 
185 


Leu 


Gly 


Gin 


He 


Ala 
190 


Glu 


Asn 


Leu 


Lys 


Asn 
195 


Gly 


Gly 


Ala 


Gly 


Ser 
200 


Met 


Leu 


Gin 


Lys 


Asp 
205 


Val 


Lys 


Thr 


He 


Ser 
210 


Asp 


Arg 


He 


Ala 


Thr 
215 


Tyr 


Gin 


Glu 


Asn 


Leu 
220 


Lys 


Gin 


Leu 


Gly 


Gly 


Met 


Leu 


Lys 


Asn 


Tyr 


Asp 


Glu 


Pro 


Tyr 


Leu 


Pro 


Gin 


Phe 


Gly 


Pro 


225 










230 










235 










240 


Gly 


Thr 


Ser 


Ser 


Gin 
245 


His 


Gly 


Val 


He 


Asn 
250 


Gly 


Phe 


Gly 


He 


Gin 
255 


Val 


Gly 


Tyr 


Lys 


Gin 
260 


Phe 


Phe 


Gly 


Asn 


Lys 
265 


Arg 


Asn 


He 


Gly 


Leu 
270 


Arg 


Tyr 


Tyr 


Ala 


Phe 
275 


Phe 


Asp 


Tyr 


Gly 


Phe 
280 


Thr 


Gin 


Leu 


Gly 


Ser 
285 


Leu 


Ser 


Ser 


Ala 


Val 


Lys 


Ala 


Asn 


He 


Phe 


Thr 


Tyr 


Gly 


Ala 


Gly 


Thr 


Asp 


Phe 


Leu 



BNSDCCID: <WO 9821225A1 I > 



WO 98/21225 



PCT/US97/21353 - 



-198- 

290 295 300 

Trp Asn lie Phe Arg Arg Val Phe Ser Asp Gin Ser Leu Asn Val Gly 
305 310 315 320 

Val Phe Gly Gly lie Gin lie Ala Gly Asn Thr Trp Asp Ser Ser Leu 

325 330 • 335 

Arg Gly Gin He Glu Asn Ser Phe Lys Glu Tyr Pro Thr Pro Thr Asn 

340 345 350 

Phe Gin Phe Leu Phe Asn Leu Gly Leu Arg Ala His Phe Ala Ser Thr 

355 360 365 

Met His Arg Arg Phe Leu Ser Ala Ser Gin Ser He Gin His Gly Met 

370 375 380 

Glu Phe Gly Val Lys He Pro Ala He Asn Gin Arg Tyr Leu Arg Ala 
385 390 395 400 

Asn Gly Ala Asp Val Asp Tyr Arg Arg Leu Tyr Ala Phe Tyr He Asn 

405 410 415 

Tyr Thr He Gly Phe 
420 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 2 0 7... 74 6 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CCCCTTAATT GCAGATGTTT TGCAAGAGGG ATTG CGTGGC GTCTATCATT CTAGAGAGAT 60 

AGACTTTGTA GAAAAAGTGG TTGTTTTAGA CAGCTGTCAA ATCCACCAAA AAGCGTTAAT 12 0 

GCATTTGCAA GAAACTTTGA TGATAGAAGT GGATAGGCTT GATTTTTCTT TAGTGG AG C G 18 0 

CTTGAACATT TTAGCGCGCA TGGAGA ATG AAA AGC ATG CGT TTT AGT TAC ATT 233 

Met Lys Ser Met Arg Phe Ser Tyr He 
1 5 

GAG CCA AGA GCG AAA TAC CTT ATC AGC AAG CTT TCT AAA ATT TGG GTT 2 81 
Glu Pro Arg Ala Lys Tyr Leu He Ser Lys Leu Ser Lys He Trp Val 
10 15 20 25 

TTT TAC ATT TTT TTA TCT TTT GTG GTA ATA GGG GGG TTA GTG TGG TTT 329 
Phe Tyr He Phe Leu Ser Phe Val Val He Gly Gly Leu Val Trp Phe 
30 35 40 

ATG CAC AAC GCC ATT AAA AGC ACT CAA GAC AAC GCG TCC AGT TTG ACG 3 77 

Met His Asn Ala He Lys Ser Thr Gin Asp Asn Ala Ser Ser Leu Thr 
45 50 55 
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ATC CAA GAA AGG CTC TAC CGC CAT GAA ATC AGC CGC TTA CAG GTT AAG 42 5 

lie Gin Glu Arg Leu Tyr Arg His Glu lie Ser Arg Leu Gin Val Lys 
60 65 70 

ACT GAT GAA ACC TTA AAA CTC ATT AAA GAA GCC AAA AAG CGT TTG AAT 4 73 

Thr Asp Glu Thr Leu Lys Leu lie Lys Glu Ala Lys Lys Arg Leu Asn 
75 80 85 

TAT AAC GAT GAT ATA CGA GAT GTT TTG CAA GGG CTT TTG AAT ATT GTG 521 
Tyr Asn Asp Asp lie Arg Asp Val Leu Gin Gly Leu Leu Asn lie Val 
90 95 100 105 

CCG GAT TCC ATC ACT ATT AAT AGC ATT GAA ATA GAC CAG CAA AGC GTG 5 69 

Pro Asp Ser lie Thr lie Asn Ser lie Glu He Asp Gin Gin Ser Val 
110 115 120 

GTT GTT AGC GGT AAA ACC CCT TCT AAA GAA GCC TTT TAT TTT TTG TTT 617 
Val Val Ser Gly Lys Thr Pro Ser Lys Glu Ala Phe Tyr Phe Leu Phe 
125 130 135 

CAA AAC AAA CTA AAC CCC ATG TTT GAT TAT TCT AGG GCG GAA TTT TTC 665 
Gin Asn Lys Leu Asn Pro Met Phe Asp Tyr Ser Arg Ala Glu Phe Phe 
140 145 150 

CCC TTA AGC GAT GGG TGG TTT AAT TTT GTC TCC ACC AAC TTT TCT AAT 713 
Pro Leu Ser Asp Gly Trp Phe Asn Phe Val Ser Thr Asn Phe Ser Asn 

155 160 165 

TCC TTA CTG ATA AAA AAT CCG GAG TCT ATT AAA TGAAGCCATT GCATTTTTCA 766 
Ser Leu Leu He Lys Asn Pro Glu Ser He Lys 
170 175 180 

CACCTGGACA GAG AG C AATC AGGCGATGTG GGGTTTATCA TTAAAAACCT CGTTTTTTTA 82 6 

GGGGTTTTTT CCTTATTGGG TTGGTTGAAT ACCGAGTATT TTCTATGGCC TAG C ATG CTG 886 
GAATTAAAAA AAATC CTTTT AGAAGAAAAT CGTAAAAAAA 92 6 

(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Met Lys Ser Met Arg Phe Ser Tyr He Glu Pro Arg Ala Lys Tyr Leu 

15 10 15 

He Ser Lys Leu Ser Lys He Trp Val Phe Tyr He Phe Leu Ser Phe 

20 25 30 

Val Val He Gly Gly Leu Val Trp Phe Met His Asn Ala He Lys Ser 

35 40 45 

Thr Gin Asp Asn Ala Ser Ser Leu Thr He Gin Glu Arg Leu Tyr Arg 
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50 55 60 

His Glu lie Ser Arg Leu Gin Val Lys Thr Asp Glu Thr Leu Lys Leu 
65 70 75 80 

lie Lys Glu Ala Lys Lys Arg Leu Asn Tyr Asn Asp Asp lie Arg Asp 

85 90 95 

Val Leu Gin Gly Leu Leu Asn lie Val Pro Asp Ser lie Thr lie Asn 

100 105 HO 

Ser He Glu He Asp Gin Gin Ser Val Val Val Ser Gly Lys Thr Pro 

115 120 125 

Ser Lys Glu Ala Phe Tyr Phe Leu Phe Gin Asn Lys Leu Asn Pro Met 

130 135 140 

Phe Asp Tyr Ser Arg Ala Glu Phe Phe Pro Leu Ser Asp Gly Trp Phe 
145 150 155 160 

Asn Phe Val Ser Thr Asn Phe Ser Asn Ser Leu Leu He Lys Asn Pro 
165 170 175 

Glu Ser He Lys 
180 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 151... 1299 
(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 87 : 



AGCACTTTCG CTTTTCATTG TTTTGATGCG 
AACGATGAGG TGAGCGATGC GTTTTTAATC 
CATAAAATCA TTCAAACCCA TTTCAAACGC 



ACTTCTAGTT TCAGGCTTTT GCAAGTGTTA 
ATACAAGATT TTAAAGAACA GCGCATCATT 
ATG TGC GTG GTT TTG AGC GTG AAA 
Met Cys Val Val Leu Ser Val Lys 
1 5 



AGA GAT GGT 
Arg Asp Gly 
10 

AAG CTT ATT 
Lys Leu He 

25 

TCT CGT TTG 
Ser Arg Leu 



AAG GGC GAG 
Lys Gly Glu 



GAA AAA ACT 
Glu Lys Thr 

TTG ATT GAT 
Leu He Asp 
30 

CCC AAT ATC 
Pro Asn He 
45 

ATC ATG GAG 
He Met Glu 



TTA GAA AAT 

Leu Glu Asn 
15 

GAA TTT GAA 

Glu Phe Glu 



CCT AGC ACC 
Pro Ser Thr 



ATT GAT GTG 
He Asp Val 



AAT GAA GAA 
Asn Glu Glu 
20 

GTT TTA GCC 
Val Leu Ala 
35 

CCT AGA GAG 
Pro Arg Glu 
50 

CCT TTT GGG 
Pro Phe Gly 



AAT AAA GAT 
Asn Lys Asp 

AAT AAA TTC 
Asn Lys Phe 



TTT GGG TTA 
Phe Gly Leu 
55 

AGT ATT TTT 
Ser lie Phe 



GAA 222 
Glu 



ATT 270 

He 

40 

GGC 318 
Gly 



GCT 366 
Ala 
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60 65 70 

TAC AGA CAC ATT GGC TCT ATC AGA CAA AAA GAA TAC AGG ATT GTA GGG 414 
Tyr Arg His He Gly Ser He Arg Gin Lys Glu Tyr Arg He Val Gly 
75 80 85 

CTT TAT CGC AAC GAT GTT TTG TTG CTC TCC ACT AAA TCT TTA GTT ATC 4 62 

Leu Tyr Arg Asn Asp Val Leu Leu Leu Ser Thr Lys Ser Leu Val He 
90 95 100 

CAG CCG CGA GAC ATT CTC TTA GTG GCG GGT AAT CCG GAA ATT TTG AAT 510 
Gin Pro Arg Asp He Leu Leu Val Ala Gly Asn Pro Glu He Leu Asn 
105 HO 115 120 

GCG GTG TAT CTT CAA GTC AAA AGC AAT GTG GGG CAG TTC CCA GCC CCC 55 8 

Ala Val Tyr Leu Gin Val Lys Ser Asn Val Gly Gin Phe Pro Ala Pro 
125 130 135 

TTT GGT AAG AGC ATT TAT TTA TAC ATT GAT ATG CGT TTG CAG AAC AGA 6 06 

Phe Gly Lys Ser He Tyr Leu Tyr He Asp Met Arg Leu Gin Asn Arg 
140 145 150 

AAA GCG ATG ATG CGC GAT GTG TAT CAA GCC TTG TTT TTG CAC AAA CAT 6 54 

Lys Ala Met Met Arg Asp Val Tyr Gin Ala Leu Phe Leu His Lys His 
155 160 165 

TTA AAG AGC TAC AAG CTC TAC ATT CAG GTT TTA CAC CCC ACT AGC CCT 7 02 

Leu Lys Ser Tyr Lys Leu Tyr He Gin Val Leu His Pro Thr Ser Pro 
170 175 180 

AAG TTT TAC CAT AAA TTT TTA GCG CTA GAA ACC GAA AGC ATT GAA GTG 75 0 

Lys Phe Tyr His Lys Phe Leu Ala Leu Glu Thr Glu Ser He Glu Val 
185 190 195 200 

AAT TTT GAT TTT TAC AGG AAA AGT TTT ATC CAA AAA CTC CAT GAA GAC 7 98 

Asn Phe Asp Phe Tyr Arg Lys Ser Phe He Gin Lys Leu His Glu Asp 
205 210 215 

CAC CAG AAA AAA ATG GGC CTA ATC GTG GTA GGC AGA GAG CTT TTT TTA 84 6 

His Gin Lys Lys Met Gly Leu He Val Val Gly Arg Glu Leu Phe Leu 
220 225 230 

TCT AAA AAA CAC CGA AAG GCC TTG TAT AAA ACA GCC ACC CCA GTT TAT 8 94 

Ser Lys Lys His Arg Lys Ala Leu Tyr Lys Thr Ala Thr Pro Val Tyr 
235 240 245 

AAA ACC AAC ACT TCT GGC TTG TCT AAA ACC TCT CAA AGC GTG GTG GTA 942 
Lys Thr Asn Thr Ser Gly Leu Ser Lys Thr Ser Gin Ser Val Val Val 
250 255 260 

TTG AAT GAA AGT TTG GAT ATT AAT GAG GAC ATG TCT TCA GTG ATT TTT 99 0 

Leu Asn Glu Ser Leu Asp He Asn Glu Asp Met Ser Ser Val He Phe 
265 270 275 280 

GAT GTG TCT ATG CAA ATG GAT TTG GGC TTG TTG CTC TAT GAT TTT GAC 103 8 
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Asp Val Ser Met Gin Met Asp Leu Gly Leu Leu Leu Tyr Asp Phe Asp 
285 290 295 

CCT AAC AAG CGC TAT AAA AAC GAG ATT GTC AAT CAT TAT GAA AAT TTA 10 86 

Pro Asn Lys Arg Tyr Lys Asn Glu He Val Asn His Tyr Glu Asn Leu 
300 305 310 

GCC AAC GCG TTC AAC CGC AAG ATT GAG ATT TTC CAA ACC GAT ATT AGA 113 4 

Ala Asn Ala Phe Asn Arg Lys He Glu He Phe Gin Thr Asp He Arg 
315 320 325 

AAT CCT ATC ATG TAT CTC AAT TCT TTA AGA AAT CCC ATT TTG CAT TTC 1182 
Asn Pro He Met Tyr Leu Asn Ser Leu Arg Asn Pro He Leu His Phe 
330 335 340 

ATG CCT TTT GAA GAG TGC ATC ACG CAC ACG CGC TTT TGG TGG TTT TTA 12 3 0 

Met Pro Phe Glu Glu Cys He Thr His Thr Arg Phe Trp Trp Phe Leu 
345 350 355 360 

TCC ACT AAA GTG GAA AAA TTA GCG TTT TTA AAC GAT GAT AAC CCT CAA 12 7 8 

Ser Thr Lys Val Glu Lys Leu Ala Phe Leu Asn Asp Asp Asn Pro Gin 

365 370 375 

ATT TTT ATC CCT GTA GCG GAG TGAAAGAATG CAAGAAATTT TAATC CCTTT AAAA 13 3 3 
He Phe He Pro Val Ala Glu 
380 

GAAAAAAACT ATAAAGTGTT TTTGGGGGAA CTGCCTGAAA TAAAATTGAA AC AAAAAG C C 13 9 3 
CTCATCATTA GCG AT AG CAT CGTAGCCGGG TTGCATTTGC CCTATTT 144 0 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 8 : 

Met Cys Val Val Leu Ser Val Lys Arg Asp Gly Glu Lys Thr Leu Glu 

X J 5 10 15 

Asn Asn Glu Glu Asn Lys Asp Glu Lys Leu He Leu He Asp Glu Phe 

20 25 30 

Glu Val Leu Ala Asn Lys Phe He Ser Arg Leu Pro Asn He Pro Ser 

35 40 45 

Thr Pro Arg Glu Phe Gly Leu Gly Lys Gly Glu He Met Glu He Asp 

50 55 60 

Val Pro Phe Gly Ser He Phe Ala Tyr Arg His He Gly Ser He Arg 

65 70 75 80 

Gin Lys Glu Tyr Arg He Val Gly Leu Tyr Arg Asn Asp Val Leu Leu 

85 90 95 

Leu Ser Thr Lys Ser Leu Val He Gin Pro Arg Asp He Leu Leu Val 
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100 105 110 

Ala Gly Asn Pro Glu lie Leu Asn Ala Val Tyr Leu Gin Val Lys Ser 

115 120 125 

Asn Val Gly Gin Phe Pro Ala Pro Phe Gly Lys Ser lie Tyr Leu Tyr 

130 135 140 

lie Asp Met Arg Leu Gin Asn Arg Lys Ala Met Met Arg Asp Val Tyr 
145 150 155 160 

Gin Ala Leu Phe Leu His Lys His Leu Lys Ser Tyr Lys Leu Tyr lie 

165 170 175 

Gin Val Leu His Pro Thr Ser Pro Lys Phe Tyr His Lys Phe Leu Ala 

180 185 190 

Leu Glu Thr Glu Ser lie Glu Val Asn Phe Asp Phe Tyr Arg Lys Ser 

195 200 205 

Phe He Gin Lys Leu His Glu Asp His Gin Lys Lys Met Gly Leu He 

210 215 220 

Val Val Gly Arg Glu Leu Phe Leu Ser Lys Lys His Arg Lys Ala Leu 
225 230 235 240 

Tyr Lys Thr Ala Thr Pro Val Tyr Lys Thr Asn Thr Ser Gly Leu Ser 

245 250 255 

Lys Thr Ser Gin Ser Val Val Val Leu Asn Glu Ser Leu Asp He Asn 

260 265 270 

Glu Asp Met Ser Ser Val He Phe Asp Val Ser Met Gin Met Asp Leu 

275 280 285 

Gly Leu Leu Leu Tyr Asp Phe Asp Pro Asn Lys Arg Tyr Lys Asn Glu 

290 295 300 

He Val Asn His Tyr Glu Asn Leu Ala Asn Ala Phe Asn Arg Lys He 
305 310 315 320 

Glu He Phe Gin Thr Asp He Arg Asn Pro He Met Tyr Leu Asn Ser 

325 330 335 

Leu Arg Asn Pro He Leu His Phe Met Pro Phe Glu Glu Cys He Thr 

340 345 350 

His Thr Arg Phe Trp Trp Phe Leu Ser Thr Lys Val Glu Lys Leu Ala 

355 360 365 

Phe Leu Asn Asp Asp Asn Pro Gin He Phe He Pro Val Ala Glu 
370 375 380 

(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 51... 464 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
AGATTTCATT CGAGGTAGAA AATACATTGA AAAAGCGTGT GAATTAAACG ATG GTA 5 6 
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Met Val 
1 

GGG GGT GGA ACG GTA AAA AAA GAC TTG AAG AAA GCC ATT CAA TAC TAT 104 
Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala lie Gin Tyr Tyr 
5 10 15 

GTT AAA GCG TGT GAA TTG AAT GAA ATG TTT GGG TGT CTG TCA TTA GTT 152 
Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser Leu Val 
20 25 30 

TCG AAC TCT CAA ATA AAC AAA CAA AAA CTC TTT CAA TAT CTC TCT AAA 2 00 

Ser Asn Ser Gin lie Asn Lys Gin Lys Leu Phe Gin Tyr Leu Ser Lys 
35 40 45 50 

GCT TGT GAA TTA AAT AGT GGT AAT GGA TGT AGG TTT TTA GGG GAT TTT 24 8 

Ala Cys Glu Leu Asn Ser Gly Asn Gly Cys Arg Phe Leu Gly Asp Phe 
55 60 65 

TAT GAG AAT GGA AAA TAT GTA AAA AAG GAT TTA AGA AAA GCT GCT CAA 2 96 

Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala Ala Gin 
70 75 80 

TAC TAC TCT AAA GCT TGT GGA TTA AAT GAT CAA GAT GGG TGT TTA ATA 34 4 

Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Cys Leu lie 
85 90 95 

CTA GGA TAT AAG CAA TAT GCT GGC AAG GGC GTA GTC AAA AAT GAA AAA 3 92 

Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn Glu Lys 
100 105 ' HO 

CAA GCG GTG AAA ACC TTT GAA AAG GCT TGT AGG TTA GGA TCT GAA GAC 44 0 

Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser Glu Asp 
115 120 125 130 

GCA TGT GGT ATT TTA AAC AAC TAC TAGATTTGAA ATAAATGCTG TTTTTTAGCT 4 94 

Ala Cys Gly lie Leu Asn Asn Tyr 
135 

GGCTTTCATG TTTTTGTAAC CCC 517 
(2) INFORMATION FOR SEQ ID NO: 90: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
Met Val Gly Gly Gly Thr Val Lys Lys Asp Leu Lys Lys Ala lie Gin 
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15 10 15 

Tyr Tyr Val Lys Ala Cys Glu Leu Asn Glu Met Phe Gly Cys Leu Ser 

20 25 30 

Leu Val Ser Asn Ser Gin lie Asn Lys Gin Lys Leu Phe Gin Tyr Leu 

35 40 45 

Ser Lys Ala Cys Glu Leu Asn Ser Gly Asn Gly Cys Arg Phe Leu Gly 

50 55 60 

Asp Phe Tyr Glu Asn Gly Lys Tyr Val Lys Lys Asp Leu Arg Lys Ala 
65 70 75 80 

Ala Gin Tyr Tyr Ser Lys Ala Cys Gly Leu Asn Asp Gin Asp Gly Cys 

85 • 90 95 

Leu He Leu Gly Tyr Lys Gin Tyr Ala Gly Lys Gly Val Val Lys Asn 

100 105 110 

Glu Lys Gin Ala Val Lys Thr Phe Glu Lys Ala Cys Arg Leu Gly Ser 

115 120 125 

Glu Asp Ala Cys Gly He Leu Asn Asn Tyr 
130 135 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
<ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 68... 1600 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

AAATGTTAGA AACCCTTACA AAACAAGCTA ATATATTCTA TTCAATTTGC CTCAAGGACA 6 0 

AACAAAC ATG AAA AAA CTT CTT TAT ACC ATA CTC GCG CTT CTT TTA ATC 109 
Met Lys Lys Leu Leu Tyr Thr lie Leu Ala Leu Leu Leu lie 
15 10 

GGC CTT TTA ACA ATC TAT CTC ATC CTT TTT AC A GAA TGG GGG AAT AAG 15 7 

Gly Leu Leu Thr He Tyr Leu He Leu Phe Thr Glu Trp Gly Asn Lys 
15 20 25 30 

ATC ATC GCT TCG TAT ATA GAG AAA AAA ATC AAC CCG AAC GAG CAC TAC 2 05 

He He Ala Ser Tyr He Glu Lys Lys He Asn Pro Asn Glu His Tyr 
35 40 45 

TTG AGC GTT AAA ACC TTT AAA TTG AG A TTC AAC TCT TTG GAT TTT AAA 25 3 

Leu Ser Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu Asp Phe Lys 
50 55 60 

GCT CAA GCC AAC GAT GAT TCC ACG CTC ATT CTT AAG GGG GAT TTT TCA 301 
Ala Gin Ala Asn Asp Asp Ser Thr Leu He Leu Lys Gly Asp Phe Ser 
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65 70 75 

CTT TTA AAG CAA AGC GTA AAT TTG AAT TAC CAT ATA GAT ATT AAA GAT 34 9 

Leu Leu Lys Gin Ser Val Asn Leu Asn Tyr His lie Asp lie Lys Asp 
80 85 90 

TTA CGC TCT TTC AAA GAA TGG ATA CCC TAC CCT TTA AGG GGG GCT GTT 3 97 

Leu Arg Ser Phe Lys Glu Trp lie Pro Tyr Pro Leu Arg Gly Ala Val 
95 100 105 HO 

ATC ACT TCT GGG AAT ATT AAA GGG CAT AGA AAA GCC CTT ATG ATT CAA 44 5 

lie Thr Ser Gly Asn lie Lys Gly His Arg Lys Ala Leu Met lie Gin 
115 120 125 

CGC GTC TCT AAT GTG GCT CAA TCC CAC ACT GCC TAC AAT GCC CTT TTA 4 93 

Gly Val Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn Ala Leu Leu 
130 135 140 

GAT GAT TTC AAG CTT TCT CGC TTA AAT TTG AAC GCA CAA GAC GCC AAT 541 
Asp Asp Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin Asp Ala Asn 
145 150 155 

TTA GAA GAT TTG CTT TAT TTA ATC AAT CGC CCC GCT TAT GCG AAC GCA 589 
Leu Glu Asp Leu Leu Tyr Leu lie Asn Arg Pro Ala Tyr Ala Asn Ala 
160 165 170 

AAA GTG TCC TTA CAG GCG GAT TTT AAC TCT CTA AAG CCT TTA GAG GGG 63 7 

Lys Val Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro Leu Glu Gly 
175 180 185 190 

CAT TTG ATC CTA ACA GCT AAT AAC GCT TTA ATC AAT AAC GCC CTA ATC 6 85 

His Leu lie Leu Thr Ala Asn Asn Ala Leu lie Asn Asn Ala Leu lie 
195 200 205 

AAT CAA ATT TTT CAT TTA AAC CTT AAA GAC ACG CTT GTT TTC AGC CTC 733 
Asn Gin lie Phe His Leu Asn Leu Lys Asp Thr Leu Val Phe Ser Leu 
210 215 220 

TCG CAT TCA AGC GAC TTT AAA GGA AAC AAA GCC ATC AGC GAT ACC ACC 781 
Ser His Ser Ser Asp Phe Lys Gly Asn Lys Ala lie Ser Asp Thr Thr 
225 230 235 

CTG ACT AGC CCT TTA GCC AAT TTC AAA GCC CTA AAA AGC GAA TAC CTT 82 9 

Leu Thr Ser Pro Leu Ala Asn Phe Lys Ala Leu Lys Ser Glu Tyr Leu 
240 245 250 

TTC TCT ATT TTA AAA CTC AAC GCC CCC TAC ACT TTA GAA ATC CCC AAT 877 
Phe Ser He Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu He Pro Asn 
255 260 265 270 

CTA GCC AAA CTC TAT AAC ATT ACC AAC CAC CCC TTA AAA GGG AGC TTG 925 
Leu Ala Lys Leu Tyr Asn He Thr Asn His Pro Leu Lys Gly Ser Leu 
275 280 285 

ACT TTA AAA GGC GCT ATA GAA CAA AGC CCC AAA CTT TTA AAA GTC AGC 97 3 
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Thr Leu Lys Gly Ala lie Glu Gin Ser Pro Lys Leu Leu Lys Val Ser 
290 295 300 

GGC CAT TCA AAT TTA CTA GAC GGC GCG CTG GAT TTC ACG CTT TTA AAT 1021 
Gly His Ser Asn Leu Leu Asp Gly Ala Leu Asp Phe Thr Leu Leu Asn 
305 310 315 

AAA GAT TTG AAA GGG CGT TTT TCC AAT ATT TCC ACT TTA AAA GCT TTA 1069 
Lys Asp Leu Lys Gly Arg Phe Ser Asn lie Ser Thr Leu Lys Ala Leu 
320 325 330 

GAT TTA TTC CAT TAC CCT AAG TTT TTC CAA TCC GTT GCA GAC GCT AAT 1117 

Asp Leu Phe His Tyr Pro Lys Phe Phe Gin Ser Val Ala Asp Ala Asn 
335 340 345 350 

TTG GAT TAT GAT CTT ATC GCT AAG CAA GGC GTA TTG AAA GCC CGC CTA 116 5 

Leu Asp Tyr Asp Leu lie Ala Lys Gin Gly Val Leu Lys Ala Arg Leu 
355 360 365 

AAA AAC GCA AGA TTC CTC AAA AAT GCA TTC AGC GAT TTT CTC TAC TCC 1213 
Lys Asn Ala Arg Phe Leu Lys Asn Ala Phe Ser Asp Phe Leu Tyr Ser 
370 375 380 

ATT TCT AAA TTT GAT ATT AC A AAA GAA ATT TAT AAC GAT GCC AAT CTG 12 61 

lie Ser Lys Phe Asp lie Thr Lys Glu lie Tyr Asn Asp Ala Asn Leu 
385 390 395 

GTA AGC CAA ATC AAC CAG CAA CGC CTG CTC TCT GAT CTG AGT TTA AAA 13 0 9 

Val Ser Gin lie Asn Gin Gin Arg Leu Leu Ser Asp Leu Ser Leu Lys 
400 405 410 

AGC CCC AAA ACC CAA TTG AAA ATC CAT AAC GGT TTG TTG GAT TTA AAC 13 57 

Ser Pro Lys Thr Gin Leu Lys lie His Asn Gly Leu Leu Asp Leu Asn 
415 420 425 430 

ACC AAA CAA ATG AAC ATG CTC ATG GAT GCG GAA ATT TTA AAA TTC ATT 14 05 

Thr Lys Gin Met Asn Met Leu Met Asp Ala Glu lie Leu Lys Phe lie 
435 440 445 

TTT AAA ATG AAA CTT CAA GGC AAC ATG CAC CAG CCA AAA TTT TCT CTC 14 5 3 

Phe Lys Met Lys Leu Gin Gly Asn Met His Gin Pro Lys Phe Ser Leu 
450 455 460 

ATT TTA AAC GAA AAA GCC ATT CAG CAA AAC TTG CAA CAA GGC TTG AAA 15 01 

lie Leu Asn Glu Lys Ala lie Gin Gin Asn Leu Gin Gin Gly Leu Lys 
465 470 475 

GAA ATC TTA AAA AAC GAC ACC CTT AAA AAA GGT TTA GAT CAT TTG CTT 154 9 

Glu lie Leu Lys Asn Asp Thr Leu Lys Lys Gly Leu Asp His Leu Leu 
480 485 490 

AAA GAT GAT AAG CTC AAA GAA AAG CTT GAA AAA GGG CTT AAG GGG CTT 15 97 

Lys Asp Asp Lys Leu Lys Glu Lys Leu Glu Lys Gly Leu Lys Gly Leu 
495 500 505 510 

TTT TAAAAATTTT AAAGGATAGA AATGGCGCAC ATTTTAGTTA GCGGGGCGAC TTC AG G 165 6 
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Phe 



GTTTGGA 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Lys Lys Leu Leu Tyr Thr He Leu Ala Leu Leu Leu He Gly Leu 

15 10 15 

Leu Thr He Tyr Leu He Leu Phe Thr Glu Trp Gly Asn Lys He He 

20 25 30 

Ala Ser Tyr He Glu Lys Lys He Asn Pro Asn Glu His Tyr Leu Ser 

35 40 45 

Val Lys Thr Phe Lys Leu Arg Phe Asn Ser Leu Asp Phe Lys Ala Gin 

50 55 60 

Ala Asn Asp Asp Ser Thr Leu He Leu Lys Gly Asp Phe Ser Leu Leu 
65 70 75 80 

Lys Gin Ser Val Asn Leu Asn Tyr His He Asp He Lys Asp Leu Arg 

85 90 95 

Ser Phe Lys Glu Trp He Pro Tyr Pro Leu Arg Gly Ala Val He Thr 

100 105 HO 

Ser Gly Asn He Lys Gly His Arg Lys Ala Leu Met He Gin Gly Val 

115 120 125 

Ser Asn Val Ala Gin Ser His Thr Ala Tyr Asn Ala Leu Leu Asp Asp 

130 135 140 

Phe Lys Leu Ser Arg Leu Asn Leu Asn Ala Gin Asp Ala Asn Leu Glu 
145 150 155 160 

Asp Leu Leu Tyr Leu He Asn Arg Pro Ala Tyr Ala Asn Ala Lys Val 

165 170 175 

Ser Leu Gin Ala Asp Phe Asn Ser Leu Lys Pro Leu Glu Gly His Leu 

180 185 190 

He Leu Thr Ala Asn Asn Ala Leu He Asn Asn Ala Leu He Asn Gin 

195 200 205 

He Phe His Leu Asn Leu Lys Asp Thr Leu Val Phe Ser Leu Ser His 

210 215 220 

Ser Ser Asp Phe Lys Gly Asn Lys Ala He Ser Asp Thr Thr Leu Thr 
225 230 235 240 

Ser Pro Leu Ala Asn Phe Lys Ala Leu Lys Ser Glu Tyr Leu Phe Ser 

245 250 255 

He Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu He Pro Asn Leu Ala 

260 265 270 

Lys Leu Tyr Asn He Thr Asn His Pro Leu Lys Gly Ser Leu Thr Leu 

275 280 285 

Lys Gly Ala He Glu Gin Ser Pro Lys Leu Leu Lys Val Ser Gly His 
290 295 300 



1663 
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Ser Asn Leu Leu Asp Gj.y-AIa Leu Asp Phe Thr Leu Leu Asn Lys Asp 
305 310 315 320 

Leu Lys Gly Arg Phe Ser Asn lie Ser Thr Leu Lys Ala Leu Asp Leu 

325 330 335 

Phe His Tyr Pro Lys Phe Phe Gin Ser Val Ala Asp Ala Asn Leu Asp 

340 345 350 

Tyr Asp Leu lie Ala Lys Gin Gly Val Leu Lys Ala Arg Leu Lys Asn 

355 360 365 

Ala Ar-g Phe Leu Lys Asn Ala Phe Ser Asp Phe Leu Tyr Ser lie Ser 

370 375 380 

Lys Phe Asp lie Thr Lys Glu lie Tyr Asn Asp Ala Asn Leu Val Ser 
385 390 395 400 

Gin lie Asn Gin Gin Arg Leu Leu Ser Asp Leu Ser Leu Lys Ser Pro 

405 410 415 

Lys Thr Gin Leu Lys lie His Asn Gly Leu Leu Asp Leu Asn Thr Lys 

420 425 430 

Gin Met Asn Met Leu Met Asp Ala Glu lie Leu Lys Phe lie Phe Lys 

435 440 445 

Met Lys Leu Gin Gly Asn Met His Gin Pro Lys Phe Ser Leu lie Leu 

450 455 460 

Asn Glu Lys Ala lie Gin Gin Asn Leu Gin Gin Gly Leu Lys Glu lie 
465 470 475 480 

Leu Lys Asn Asp Thr Leu Lys Lys Gly Leu Asp His Leu Leu Lys Asp 

485 490 495 

Asp Lys Leu Lys Glu Lys Leu Glu Lys Gly Leu Lys Gly Leu Phe 
500 505 510 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 292... 645 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

AGTGCATAAA CGCACAGACC CCAAAAATGA AAGCTATTTT TGGCTAGGGC TACACCCTTT 6 0 

AGAATGGCAA AAGCGCGAAA ATGAAGACAG ACTCTCTGAT TTTGACGCTA TTGCTTCAAA 12 0 

CCATGCCTCT ATCACGCCTT TAAATTTAGA CTTAACCAGT TATGATGATT TGAAAAGTTT 180 

GGAATCTTGG CATGAGGGAA TGTTAAAGTG AGTAAAAAGC ACCGCTTGGC TTTTTTAGGG 24 0 

CTAATTGTTG GGGTTCTATT CTTCTTTAGT GCGTGTGAGC ACCGCCTGCA C ATG GGG 2 97 

Met Gly 
1 

TAT TAT TCA GAA GTT ACA GGG GAT TAT TTG TTC AAT TAT AAT TCC ACT 34 5 
Tyr Tyr Ser Glu Val Thr Gly Asp Tyr Leu Phe Asn Tyr Asn Ser Thr 
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5 10 15 

ATC GTG GTG GCT TAT GAC AGA AGC GAT GCG ATG ACT TCT TAT TAT ATC 3 93 

lie Val Val Ala Tyr Asp Arg Ser Asp Ala Met Thr Ser Tyr Tyr lie 
20 25 30 

AAT GTG ATT GTT TAT GAA TTG CAA AAA TTA GGC TTT TAC AAT GTC TTC ,441 

Asn Val lie Val Tyr Glu Leu Gin Lys Leu Gly Phe Tyr Asn Val Phe 

35 40 45 50 

ACG CAA GCG GAA TTC CCA CTA GAT AAA GCC AAA AAT GTG ATC TAT GCG 4 89 

Thr Gin Ala Glu Phe Pro Leu Asp Lys Ala Lys Asn Val lie Tyr Ala 
55 60 65 

CGC ATT GTC CGT AAC ATC TCA GCT GTG CCG TTC TAC CAA TAC AAT TAC 53 7 

Arg lie Val Arg Asn lie Ser Ala Val Pro Phe Tyr Gin Tyr Asn Tyr 
70 75 80 

CAA CTG ATT GAT CAA GTC AAT AAG CCT TGT TAT TTT CTT GGG GGG CAG 5 85 

Gin Leu lie Asp Gin Val Asn Lys Pro Cys Tyr Phe Leu Gly Gly Gin 
85 90 95 

TTT TAT TGC TCT CAA ACC CTA CGG ATT ATT ACG CTA TCA ATG GCT TTA 63 3 

Phe Tyr Cys Ser Gin Thr Leu Arg lie He Thr Leu Ser Met Ala Leu 
100 105 110 

GCG AGC AAA TTT TAATGAGTGC T AATTCG CAT TTTATTTTAG ATTGGTATGA TGTGG 6 90 

Ala Ser Lys Phe 

115 

TGTTGCAAAA ACGGGTTTTA TATGTGGATG GG AG CGTG AG CGGGAGGACT TGCGGCTATC 75 0 

AG ATG CTGTA TAGGGATTTG ATT AAAAG C A CGATCAAACG CATTGATTTT AACCGCCCTG 810 

AACGCTACTA CTACAATTTA AGACTGCCCC TTTATCAGCC ATGTTATAGG CAATGAAATG 870 

GTTATCAGGC GATTGTATCA ATTTTGCGCT AGCCATGTGG TGCGCAATTG CTCTTCTTTA 93 0 

AAATGCGCTC AAAATAT 947 

(2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 

Met Gly Tyr Tyr Ser Glu Val Thr Gly Asp Tyr Leu Phe Asn Tyr Asn 

15 10 15 

Ser Thr He Val Val Ala Tyr Asp Arg Ser Asp Ala Met Thr Ser Tyr 

20 25 30 

Tyr He Asn Val He Val Tyr Glu Leu Gin Lys Leu Gly Phe Tyr Asn 

35 40 45 

Val Phe Thr Gin Ala Glu Phe Pro Leu Asp Lys Ala Lys Asn Val He 
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50 - 55 60 

Tyr Ala Arg lie Val Arg Asn lie Ser Ala Val Pro Phe Tyr Gin Tyr 
65 70 75 80 

Asn Tyr Gin Leu lie Asp Gin Val Asn Lys Pro Cys Tyr Phe Leu Gly 

85 90 95 

Gly Gin Phe Tyr Cys Ser Gin Thr Leu Arg lie lie Thr Leu Ser Met 

100 105 110 

Ala Leu Ala Ser Lys Phe 
115 



(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 



(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 348... 716 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



TGCGGAGGGA ATGTCTATGA TAAAATCTCA GAAAAATTTG TAGAAAAAGT GGATAACGGG 6 0 

TTTTGAAAAT TTTAATCCTT TTTTTTATCT GTTTAAACGC ATTGTTCGCC C TAG ATT C AA 12 0 

ACGCACTTAA AG C AG AG ATT AAAGAAGTTT ACCTTAAAGA ATACAAAGAC TTAAAATTAG 18 0 

AAATTGAAAC CATTAACTTA GAAATCCCAG AG CGCTTTTC TAACGCTTCC ATTTTAAGCT 24 0 

ATGAATTAAA CGCTTCCAAT AAGCTTAAAA AAGATGGGGT CGTGTTTTTA AGGTTGGAAA 3 00 

ATGATCCTAA TTTACGCCTA CCGGTGCGTT ATAGCGTGAT AGGCAGC ATG CAG GCT 3 56 

Met Gin Ala 
1 



TTT AAA AGC 

Phe Lys Ser 
5 

AAC ACT CAA 

Asn Thr Gin 
20 

TTA GAG GGC 

Leu Glu Gly 



AAC ACG CTT 
Asn Thr Leu 



AAA AAT GAC 
Lys Asn Asp 



GTT AGC GCG 
Val Ser Ala 



AAA GAG CGC 
Lys Glu Arg 
25 

GCG ATT GAT 
Ala lie Asp 
40 

TTA AGC ACG 
Leu Ser Thr 
55 

ATT ATC ACC 
lie He Thr 



ATT AAA AAA 
He Lys Lys 
10 

ATT TTG TTT 
He Leu Phe 



AAA GTG AGC 
Lys Val Ser 



GAT AAA ACC 
Asp Lys Thr 
60 

GGG GTG TAT 
Gly Val Tyr 



GAT GAA AAC 
Asp Glu Asn 
15 

GGT GCG CTT 
Gly Ala Leu 
30 

GCG AAA AAT 
Ala Lys Asn 
45 

CAA GCT TTA 
Gin Ala Leu 



GAA GAG GGG 
Glu Glu Gly 



ATC ACC GCT 
He Thr Ala 



TCT AAC CCC 
Ser Asn Pro 



TTT ATC CCC 
Phe He Pro 
50 

ATT ATC GTG 
He He Val 
65 

CAA ATC AGC 
Gin He Ser 



AAT 4 04 

Asn 



TTA 4 52 

Leu 

35 

CCT 500 
Pro 



CGT 54 8 

Arg 

ATA 5 96 

He 
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7 0 7 5 8 0 

GAA ATA AGC CTA AAA GCC CTA GAA AAT GGC GCG CTT AAT CAA ATC ATT 64 4 

Glu lie Ser Leu Lys Ala Leu Glu Asn Gly Ala Leu Asn Gin lie lie 
85 90 95 

CAA GCG AAA AAT TTA GAA AGC AAT AAA ATA CTC AAA GCA AAA GTG TTG 6 92 

Gin Ala Lys Asn Leu Glu Ser Asn Lys lie Leu Lys Ala Lys Val Leu 
100 105 110 115 

AGC AGC TCT AAA GCG CAA ATC TTA TAAAGGACAT TCATGAAATT GGTTTTAGGC 74 6 
Ser Ser Ser Lys Ala Gin lie Leu 
120 

ATC AG TGG AG CGAGCGGGAT ACCCCTAGCC TTGCGGTTTT TAGAAAAATT ACCCAAAGAA 8 06 

ATTGAAGTTT TTGTCGTGGC GTCTAAAAAC GCGCATGTCG TGGCGTTAGA AGAATCTAAT 8 66 

ATTAACCTT 875 

(2) INFORMATION FOR SEQ ID NO: 96: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 96 : 

Met Gin Ala Phe Lys Ser Val Ser Ala He Lys Lys Asp Glu Asn He 

15 10 15 

Thr Ala Asn Asn Thr Gin Lys Glu Arg He Leu Phe Gly Ala Leu Ser 

20 25 30 

Asn Pro Leu Leu Glu Gly Ala He Asp Lys Val Ser Ala Lys Asn Phe 

35 40 45 

He Pro Pro Asn Thr Leu Leu Ser Thr Asp Lys Thr Gin Ala Leu He 

50 55 60 

He Val Arg Lys Asn Asp He He Thr Gly Val Tyr Glu Glu Gly Gin 
65 70 75 80 

He Ser He Glu He Ser Leu Lys Ala Leu Glu Asn Gly Ala Leu Asn 

85 90 95 

Gin He He Gin Ala Lys Asn Leu Glu Ser Asn Lys He Leu Lys Ala 

100 105 HO 

Lys Val Leu Ser Ser Ser Lys Ala Gin He Leu 
115 120 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ixj FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 160... 345 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

GGCATCACTT TTAACATGAC CCCTTCTCCA GGCGCGACGA GTTGTTTGCA AAACGCCCTT 6 0 

GTGGATTCCC AAGAAATCGC TGCGTATTTG GGCG AG AG CT TTGAATTAGA ACGCTTTTAT 12 0 

AAAGATTTAT CCCCAGAAGA ATTGGAAAAT TAAAAACGC ATG CAA AAA GAA CAA 17 4 

Met Gin Lys Glu Gin 
1 5 

GAA GCC CAA GAA ATC GCT AAA AAA GCC GTT AAA ATC GTG TTT TTT TTA 222 
Glu Ala Gin Glu lie Ala Lys Lys Ala Val Lys lie Val Phe Phe Leu 
10 15 20 

GGG CTT GTG GTG GTG CTT TTG ATG ATG ATA AAC CTT TAC ATG CTC ATC 2 70 

Gly Leu Val Val Val Leu Leu Met Met lie Asn Leu Tyr Met Leu He 
25 30 35 

AAT CAA ATC AAC GCG AGC GCT CAA ATG AGC CAC CAA ATC AAA AAG ATA 318 
Asn Gin He Asn Ala Ser Ala Gin Met Ser His Gin He Lys Lys He 
40 45 50 

GAA GAA AGG CTT AAT CAG GAG CAA AAA TAAAAAAGGC TTTTTGGTAT TTTTACG 3 72 
Glu Glu Arg Leu Asn Gin Glu Gin Lys 
55 60 

ATCAAATAGT AAAGAGCTTA TC 3 94 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Met Gin Lys Glu Gin Glu Ala Gin Glu He Ala Lys Lys Ala Val Lys 

15 10 15 

He Val Phe Phe Leu Gly Leu Val Val Val Leu Leu Met Met He Asn 

20 25 30 

Leu Tyr Met Leu He Asn Gin He Asn Ala Ser Ala Gin Met Ser His 

35 40 45 

Gin He Lys Lys He Glu Glu Arg Leu Asn Gin Glu Gin Lys 
50 55 60 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 320... 880 
(D) OTHER INFORMATION: 



(A) NAME /KEY : sig_peptide 

(B) LOCATION: 320... 400 
(D) OTHER INFORMATION: 



(A) NAME/KEY: mat_peptide 

(B) LOCATION: 401... 880 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

AGATTAGCAG CAGCAGGGAT TTTTAAATTC CTGGCCAACA GGGGCGGTTG GAAAAAAATA 6 0 

CGATTAAAAA GGCAAACGCT TTGAAAGTAT TTTTT CAT AG AAATTCCCTT TTGTTAAATG 12 0 

ATTGAAGTTG GTGATTATAC CTATTTGTAT CTTAAAAATT TG AT TTT AAA AGTTTGAGAT 18 0 

GGTTTTGTAG GTGTATCCCA CTTATCCAAT TTATATCAAT ATTTTCACTC TAAAACCCTC 24 0 

ATCCTTGATA AAAAATTAAA CCTTTTAGAA AAATAACCGA TTTTAGGGTG TAACTTTAAT 3 00 

TCAACAAGAA GGATTTATT ATG ATT AAA AGA ATT GCT TGT ATT TTA AGC TTG 3 52 

Met lie Lys Arg lie Ala Cys lie Leu Ser Leu 

-27 -25 -20 

AGT GCG AGT TTA GCG CTG GCT GGC GAA GTG AAT GGG TTT TTC ATG GGT 400 
Ser Ala Ser Leu Ala Leu Ala Gly Glu Val Asn Gly Phe Phe Met Gly 
-15 -10 -5 

GCG GGT TAT CAG CAA GGT CGT TAT GGT CCT TAT AAC AGC AAT TAC TCT 44 8 

Ala Gly Tyr Gin Gin Gly Arg Tyr Gly Pro Tyr Asn Ser Asn Tyr Ser 
15 10 15 

GAT TGG CGC CAT GGC AAT GAT CTT TAT GGT TTG AAT TTC AAA TTA GGT 4 96 

Asp Trp Arg His Gly Asn Asp Leu Tyr Gly Leu Asn Phe Lys Leu Gly 
20 25 30 

TTT GTA GGC TTT GCC AAT AAA TGG TTT GGG GCT AGG GTG TAT GGC TTT 54 4 

Phe Val Gly Phe Ala Asn Lys Trp Phe Gly Ala Arg Val Tyr Gly Phe 
35 40 45 

TTA GAT TGG TTT AAC ACT TCA GGG ACA GAA CAC ACC AAA ACC AAT TTG 5 92 
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Leu Asp Trp Pne Asn 'rnr - Ser Gly Thr Glu His Thr Lys Thr Asn Leu 
50 55 60 

CTC ACC TAT GGT GGC GGT GGC GAT TTG ATT GTC AAT CTC ATT CCT TTG 64 0 

Leu Thr Tyr Gly Gly Gly Gly Asp Leu lie Val Asn Leu lie Pro Leu 
65 70 75 80 

GAT AAA TTC GCT CTA GGT CTC ATC GGT GGC GTT CAA TTA GCC GGA AAC 6 88 

Asp Lys Phe Ala Leu Gly Leu lie Gly Gly Val Gin Leu Ala Gly Asn 
85 90 95 

ACT TGG ATG TTC CCT TAT GAT GTC AAT CAA ACG AGA TTC CAG TTC TTA 7 36 

Thr Trp Met Phe Pro Tyr Asp Val Asn Gin Thr Arg Phe Gin Phe Leu 
100 105 110 

TGG AAT TTA GGC GGA AGA ATG CGT GTT GGG GAT CGC AGT GCG TTT GAA 784 
Trp Asn Leu Gly Gly Arg Met Arg Val Gly Asp Arg Ser Ala Phe Glu 
115 120 125 

GCA GGC GTG AAA TTC CCT ATG GTT AAT CAA GGC AAC AAA GAT GTT AGG 83 2 

Ala Gly Val Lys Phe Pro Met Val Asn Gin Gly Asn Lys Asp Val Arg 
130 135 140 

GCT TAT CCG CTA CTA TTC TTG GGT ATG TGG ATT ATG TTC TTC ACT TTC T 881 
Ala Tyr Pro Leu Leu Phe Leu Gly Met Trp lie Met Phe Phe Thr Phe 
145 150 155 160 

AATTTATTCC TTTCATTCGC TCTTCTTCAT CAAATCAACC CTAACCCACT CTTAAAAGGT 941 
TGGGGTTCAA AAATCTTTTT CATAAATAAA ATTTGCCTTA A 982 

(2) INFORMATION FOR SEQ ID NO:100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 100: 

Met lie Lys Arg lie Ala Cys lie Leu Ser Leu Ser Ala Ser Leu Ala 

-27 -25 -20 -15 

Leu Ala Gly Glu Val Asn Gly Phe Phe Met Gly Ala Gly Tyr Gin Gin 

-10 -5 15 

Gly Arg Tyr Gly Pro Tyr Asn Ser Asn Tyr Ser Asp Trp Arg His Gly 

10 15 20 

Asn Asp Leu Tyr Gly Leu Asn Phe Lys Leu Gly Phe Val Gly Phe Ala 

25 30 35 

Asn Lys Trp Phe Gly Ala Arg Val Tyr Gly Phe Leu Asp Trp Phe Asn 

40 45 50 

Thr Ser Gly Thr Glu His Thr Lys Thr Asn Leu Leu Thr Tyr Gly Gly 
55 60 65 
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Gly Gly Asp Leu lie vai-Asn Leu lie Pro Leu Asp Lys Phe Ala Leu 
70 75 80 85 

Gly Leu lie Gly Gly Val Gin Leu Ala Gly Asn Thr Trp Met Phe Pro 

90 95 100 

Tyr Asp Val Asn Gin Thr Arg Phe Gin Phe Leu Trp Asn Leu Gly Gly 

105 110 115 

Arg Met Arg Val Gly Asp Arg Ser Ala Phe Glu Ala Gly Val Lys Phe 

120 125 130 

Pro Met Val Asn Gin Gly Asn Lys Asp Val Arg Ala Tyr Pro Leu Leu 

135 140 145 

Phe Leu Gly Met Trp lie Met Phe Phe Thr Phe 
150 155 160 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

CA) NAME/ KEY : Coding Sequence 
(B) LOCATION: 262 . . . 777 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CCAATGGAGG CGTTTCCAAA AACCCAAACG GGCGCTTTTT AAAGAAAAAT CTCAAAAAAT 60 

TCAGGGAGCA AGCGGTAAAA ATCGTAGAAA AACGCTTGAT AAAAGAGAAT ATGCAACTGA 12 0 

GCGATTTTAA TGAAGAAGAA TTAAAAATCA TGTTTGAAGC TGAAGAAAAA AGGTTGTTAG 180 

AG C AAATCC A CGCTAAAGAA TTGAAAGAAA AG CAAG AAAA AACCACCAAG CATTTTAAAG 24 0 

AAGTTTGGGA AAAGGGCGAA A ATG AGC AAG AAA AAT AGC GTA ATT TCT GGT 2 91 

Met Ser Lys Lys Asn Ser Val lie Ser Gly 
1 5 10 

TTA ATG AAT TTT TTT AGC GAA AAG AAT GAA CGC TGG CTC TTA GCC CAC 33 9 

Leu Met Asn Phe Phe Ser Glu Lys Asn Glu Arg Trp Leu Leu Ala His 
15 20 25 

AGG CAC ACG AG A GGG TTT GTG ATA GTG GCG TGG CTT TTT CGG TTT AAA 3 87 

Arg His Thr Arg Gly Phe Val lie Val Ala Trp Leu Phe Arg Phe Lys 
30 * 35 40 

AGC ATT GCG TTT TCT ATT TTG ATC ACT CTG TTG GTT ATT TTA GTG GAT 43 5 

Ser He Ala Phe Ser He Leu lie Thr Leu Leu Val He Leu Val Asp 
45 50 55 

ATT TGG GTT TAT AGC GAT GTG CGT CAG TTT TTA TTG GAC ACT TCT AGC 4 83 

He Trp Val Tyr Ser Asp Val Arg Gin Phe Leu Leu Asp Thr Ser Ser 
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60 -65 70 

TCT TTT ATT TGG CTT TTG ATC GCT TTA CTA ATC AAG TGG GGC GTG ATT 531 

Ser Phe lie Trp Leu Leu lie Ala Leu Leu lie Lys Trp Gly Val lie 

75 80 85 90 

GTC ATA AGC GCA CGT AAA TGC TAC CAA TTC AGC CAA AAA ATG TTT ACG 57 9 

Val lie Ser Ala Arg Lys Cys Tyr Gin Phe Ser Gin Lys Met Phe Thr 
95 100 105 

CTC ATT CAA AGA AAA AGG CAA ATC AGA GAG AAT TTA AAA AAC CGC TCC 62 7 

Leu lie Gin Arg Lys Arg Gin lie Arg Glu Asn Leu Lys Asn Arg Ser 
110 115 120 

AAC TAC AAA GAT ACC AAA AAT GCG GAA AAA CTC TCT AGC ATC GCT GAA 6 75 

Asn Tyr Lys Asp Thr Lys Asn Ala Glu Lys Leu Ser Ser lie Ala Glu 
125 130 135 

GAA ATC ATT TCA AAA AAA CAA GAA GAG TCC CGC CCC AAA GAA GAT TCT 72 3 

Glu lie lie Ser Lys Lys Gin Glu Glu Ser Arg Pro Lys Glu Asp Ser 
140 145 150 

AAT CAT GAA AAC CAT AAA GAA AAG CTT TCT AAC ATT ACC GAA GAA AGT 771 

Asn His Glu Asn His Lys Glu Lys Leu Ser Asn lie Thr Glu Glu Ser 

155 160 165 170 

GAT TCT TAAAAAACAA GAGGAATTGA AAAGCTAAAA AGGATAGGGG GGGATTACCC AA 82 9 
Asp Ser 



AGCATATTGG AGGG 84 3 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Met Ser Lys Lys Asn Ser Val lie Ser Gly Leu Met Asn Phe Phe Ser 

15 10 15 

Glu Lys Asn Glu Arg Trp Leu Leu Ala His Arg His Thr Arg Gly Phe 

20 25 30 

Val lie Val Ala Trp Leu Phe Arg Phe Lys Ser lie Ala Phe Ser lie 

35 40 45 

Leu lie Thr Leu Leu Val lie Leu Val Asp lie Trp Val Tyr Ser Asp 

50 55 60 
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Val Arg Gin Phe beu Leu- Asp Thr Ser Ser Ser Phe lie Trp Leu Leu 
65 70 75 80 

lie Ala Leu Leu He Lys Trp Gly Val lie Val lie Ser Ala Arg Lys 

85 90 95 

Cys Tyr Gin Phe Ser Gin Lys Met Phe Thr Leu lie Gin Arg Lys Arg 

100 105 110 

Gin lie Arg Glu Asn Leu Lys Asn Arg Ser Asn Tyr Lys Asp Thr Lys 

115 120 125 

Asn Ala Glu Lys Leu Ser Ser He Ala Glu Glu He He Ser Lys Lys 

130 135 140 

Gin Glu Glu Ser Arg Pro Lys Glu Asp Ser Asn His Glu Asn His Lys 
145 150 155 160 

Glu Lys Leu Ser Asn He Thr Glu Glu Ser Asp Ser 
165 170 



(2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1047 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 34 . . . 1005 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 103: 

AGAAAGAAAC CATTCAAGGA ACGCATTGAT TTG ATG AAT AAA CCA TTT TTA ATC 54 

Met Asn Lys Pro Phe Leu He 

1 5 



TTA CTC ATA 
Leu Leu He 
10 

TTC AAA CCC 
Phe Lys Pro 
25 

CAT TTG CAA 
His Leu Gin 
40 

AAA AAT GGA 
Lys Asn Gly 

GGT AAG AAC 



GCC CTA ATT 
Ala Leu He 



GCT AAA CAC 
Ala Lys His 



GAA AGT ATC 
Glu Ser He 
45 

GCG GTT ATA 
Ala Val He 
60 

TTC AAT TAC 



GTC TTT AGC 
Val Phe Ser 
15 

CAA ATT AAA 
Gin He Lys 
30 

GTT TCG TCT 
Val Ser Ser 



GGC GAT AAA 
Gly Asp Lys 



GAA AGC AGT 



GGC TGT AAC 
Gly Cys Asn 



GGC GAA GCG 
Gly Glu Ala 
35 

AAT CGT TAT 
Asn Arg Tyr 
SO 

GGT TTA ACG 
Gly Leu Thr 
65 

TTT TTA AAT 



ATG AGA AAA 
Met Arg Lys 
20 

TAT TTC CCT 
Tyr Phe Pro 



GGA GCC ATT 
Gly Ala He 



CAG CTA AGA 
Gin Leu Arg 
70 

GAG AGT CAA 



TAT 102 
Tyr 



AAC 150 
Asn 



TTG 198 

Leu 

55 

ATC 246 
He 



GGG 2 94 
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Gly Lys Asn pne Asn iyr-Glu Ser Ser Phe Leu Asn Glu Ser Gin Gly 
75 80 85 

TTT TTT ATT CTT GCG CAA GAT TGT TTG AAC AAG ATT GAT AAA AAA ACA 34 2 

Phe Phe lie Leu Ala Gin Asp Cys Leu Asn Lys lie Asp Lys Lys Thr 
90 95 100 

AAC AAA AGC AAG GTG GCT AAG ACT GAA GAA ACG GAA TTG AAA TTA AAG 3 90 

Asn Lys Ser Lys Val Ala Lys Thr Glu Glu Thr Glu Leu Lys Leu Lys 
105 110 115 

GGC GTT GAA GCG GAA GTC CAA GAT AAA GTC TGT CAT CAA GTG GAA TTG 43 8 

Gly Val Glu Ala Glu Val Gin Asp Lys Val Cys His Gin Val Glu Leu 
120 125 130 135 

ATT AGC AAT AAC CCT AAC GCC AGC CAA CAA TCT ATC GTT ATT CCT TTG 4 86 

lie Ser Asn Asn Pro Asn Ala Ser Gin Gin Ser lie Val lie Pro Leu 
140 145 150 

GAG ACT TTT GCC TTG AGC GCA AGC GTT AAA GGG AAT CTT TTA GCG GTG 534 
Glu Thr Phe Ala Leu Ser Ala Ser Val Lys Gly Asn Leu Leu Ala Val 
155 160 165 

GTG TTA GCG GAC AAT TCA GCG AAC TTA TAC GAC ATC ACT TCT CAA AAA 5 82 

Val Leu Ala Asp Asn Ser Ala Asn Leu Tyr Asp lie Thr Ser Gin Lys 
170 175 180 

TTG CTT TTT AGT GAG AAA GGT TCC CCA AGC ACC ACG ATC AAT TCT TTA 63 0 

Leu Leu Phe Ser Glu Lys Gly Ser Pro Ser Thr Thr lie Asn Ser Leu 
185 190 195 

ATG GCG ATG CCT ATT TTT ATG GAT ACG GTC GTG GTG TTC CCC ATG CTA 67 8 

Met Ala Met Pro lie Phe Met Asp Thr Val Val Val Phe Pro Met Leu 
200 205 210 215 

GAT GGG CGC TTG TTG GTC GTG GAT TAT GTG CAC GGA AAC CCT ACG CCT 726 
Asp Gly Arg Leu Leu Val Val Asp Tyr Val His Gly Asn Pro Thr Pro 
220 225 230 

ATT AGA AAC ATT GTT ATC AGC AGC GAT AAG TTT TTT AAC AAT ATC ACC 7 74 

lie Arg Asn He Val He Ser Ser Asp Lys Phe Phe Asn Asn He Thr 
235 240 245 

TAC CTT ATC GTA GAT GGC AAT AAC ATG ATC GCT TCT ACA GGG AAA AGG 822 
Tyr Leu He Val Asp Gly Asn Asn Met He Ala Ser Thr Gly Lys Arg 
250 255 260 

ATA CTC TCA GTA GTG AGC GGT CAA GAG TTC AAC TAT GAT GGG GAT ATT 87 0 

He Leu Ser Val Val Ser Gly Gin Glu Phe Asn Tyr Asp Gly Asp He 
265 270 275 

GTG GAT TTG CTT TAT GAT AAG GGG ACT TTA TAT GTG CTC ACG CTA GAC 918 
Val Asp Leu Leu Tyr Asp Lys Gly Thr Leu Tyr Val Leu Thr Leu Asp 
280 - 285 290 295 
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GGG CAG ATT TTG CJAA ATG-GAT AAG AGT TTG AGG GAA TTA AAC AGC GTG 96 6 

Gly Gin lie Leu Gin Met Asp Lys Ser Leu Arg Glu Leu Asn Ser Val 
300 305 310 

AAA CTG CCT NTC NTC GCT CAA CAC GAT TGT ATT AAA CCA TAATAAATTG TA 1017 
Lys Leu Pro Xaa Xaa Ala Gin His Asp Cys He Lys Pro 
315 320 

TTCTTTAGAA AAACGAGGGT ATGTGATAGA 104 7 

(2) INFORMATION FOR SEQ ID NO:104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Met Asn Lys Pro Phe Leu He Leu Leu He Ala Leu He Val Phe Ser 

15 10 15 

Gly Cys Asn Met Arg Lys Tyr Phe Lys Pro Ala Lys His Gin He Lys 

20 25 30 

Gly Glu Ala Tyr Phe Pro Asn His Leu Gin Glu Ser He Val Ser Ser 

35 40 45 

Asn Arg Tyr Gly Ala He Leu Lys Asn Gly Ala Val He Gly Asp Lys 

50 55 60 

Gly Leu Thr Gin Leu Arg He Gly Lys Asn Phe Asn Tyr Glu Ser Ser 
65 70 75 80 

Phe Leu Asn Glu Ser Gin Gly Phe Phe He Leu Ala Gin Asp Cys Leu 

85 90 95 

Asn Lys He Asp Lys Lys Thr Asn Lys Ser Lys Val Ala Lys Thr Glu 

100 105 HO 

Glu Thr Glu Leu Lys Leu Lys Gly Val Glu Ala Glu Val Gin Asp Lys 

115 120 125 

Val Cys His Gin Val Glu Leu He Ser Asn Asn Pro Asn Ala Ser Gin 

130 135 140 

Gin Ser He Val He Pro Leu Glu Thr Phe Ala Leu Ser Ala Ser Val 
145 150 155 160 

Lys Gly Asn Leu Leu Ala Val Val Leu Ala Asp Asn Ser Ala Asn Leu 

165 170 175 

Tyr Asp He Thr Ser Gin Lys Leu Leu Phe Ser Glu Lys Gly Ser Pro 

180 185 190 

Ser Thr Thr He Asn Ser Leu Met Ala Met Pro He Phe Met Asp Thr 

195 200 205 

Val Val Val Phe Pro Met Leu Asp Gly Arg Leu Leu Val Val Asp Tyr 

210 215 220 

Val His Gly Asn Pro Thr Pro He Arg Asn He Val He Ser Ser Asp 
225 230 235 240 

Lys Phe Phe Asn Asn lie Thr Tyr Leu He Val Asp Gly Asn Asn Met 

245 250 255 

He Ala Ser Thr Gly Lys Arg He Leu Ser Val Val Ser Gly Gin Glu 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 



-221- 

260 265 270 

Phe Asn Tyr Asp Gly Asp lie Val Asp Leu Leu Tyr Asp Lys Gly Thr 

275 280 285 

Leu Tyr Val Leu Thr Leu Asp Gly Gin lie Leu Gin Met Asp Lys Ser 

290 295 300 

Leu Arg Glu Leu Asn Ser Val Lys Leu Pro Xaa Xaa Ala Gin His Asp 
305 310 315 320 

Cys lie Lys Pro 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 153... 1793 
(D) OTHER INFORMATION: 



(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 153... 2 19 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

TCTGGGGGCA TTGCTTACCC TACTACTCGC TTGAAACGCC CAAGCCTGAT CCAATCTCAT 6 0 

AAAGATTCTA ATCGCAATTT TAAAACCATC ACTTTTTGGC TCGTTCCCAC AAAAAGCCAC 12 0 

GCAACTTACT ACATCATTAA GGTTTAATCA CA ATG GAT AAA AAC AAC AAT AAT 173 

Met Asp Lys Asn Asn Asn Asn 
-20 

CTC CGC TTG ATT TTA GCG ATC GCT CTG TCT TTC TTG TTT ATC GCT CTT 221 
Leu Arg Leu lie Leu Ala lie Ala Leu Ser Phe Leu Phe lie Ala Leu 
-15 -10 -5 1 

TAT AGC TAT TTT TTC CAA AAA CCA AAC AAA ACA ACA ACC CAA ACC ACA 269 
Tyr Ser Tyr Phe Phe Gin Lys Pro Asn Lys Thr Thr Thr Gin Thr Thr 
5 10 15 

AAG CAA GAA ACA ACC AAC AAC CAT ACA GCA ACA AGT CCT AAC GCG CCC 317 
Lys Gin Glu Thr Thr Asn Asn His Thr Ala Thr Ser Pro Asn Ala Pro 
20 25 30 

AAC GCC CAA CAT TTT AGC ACC ACT CAA ACA ACC CCC CAA GAG AAT TTG 3 65 

Asn Ala Gin His Phe Ser Thr Thr Gin Thr Thr Pro Gin Glu Asn Leu 
35 - 40 45 
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CTA AGC ACG ATT TCl TTT -GAG CAT GCC AGG ATT GAA ATT GAT TCT TTA 413 
Leu Ser Thr lie Ser Phe Glu His Ala Arg He Glu He Asp Ser Leu 
50 55 60 65 

GGG CGC ATC AAA CAG GTT TAT CTC AAG GAT AAA AAG TAT CTA ACC CCT 4 61 

Gly Arg He Lys Gin Val Tyr Leu Lys Asp Lys Lys Tyr Leu Thr Pro 
70 75 80 

AAA CAA AAG GGC TTT TTA GAG CAT GTG GGC CAT CTT TTT AGC TCC AAA 50 9 

Lys Gin Lys Gly Phe Leu Glu His Val Gly His Leu Phe Ser Ser Lys 
85 90 95 

GAA AAC GCG CAA CCC CCC CTA AAA GAG CTC CCC CTT TTA GCA GCC GAT 55 7 

Glu Asn Ala Gin Pro Pro Leu Lys Glu Leu Pro Leu Leu Ala Ala Asp 
100 105 HO 

AAA CTC AAG CCT TTA GAA GTG CGT TTT TTA GAC CCT ACG CTC AAT AAC 6 05 

Lys Leu Lys Pro Leu Glu Val Arg Phe Leu Asp Pro Thr Leu Asn Asn 
115 120 125 

AAA GCG TTC AAC ACC CCT TAT AGC GCT TCA AAA ACC ACT CTT GGG CCT 653 
Lys Ala Phe Asn Thr Pro Tyr Ser Ala Ser Lys Thr Thr Leu Gly Pro 
130 135 140 145 

AAC GAA CAG CTT GTT TTA ACC CAA GAT TTA GGC ACT CTT AGC ATC ATT 701 
Asn Glu Gin Leu Val Leu Thr Gin Asp Leu Gly Thr Leu Ser He He 
150 155 160 

AAA ACC CTG ACT TTC TAT GAT GAT TTG CAT TAT GAT TTA AAA ATC GCA 74 9 

Lys Thr Leu Thr Phe Tyr Asp Asp Leu His Tyr Asp Leu Lys He Ala 
165 170 175 

TTC AAA TCG CCC AAT AAC CTT ATC CCT AGC TAT GTG ATC ACC AAT GGT 7 97 

Phe Lys Ser Pro Asn Asn Leu He Pro Ser Tyr Val He Thr Asn Gly 
180 185 190 

TAC AGG CCG GTG GCT GAT TTG GAC AGC TAC ACC TTT TCA GGC GTG CTT 84 5 

Tyr Arg Pro Val Ala Asp Leu Asp Ser Tyr Thr Phe Ser Gly Val Leu 
195 200 205 

TTA GAA AAT AGC GAC AAA AAA ATT GAA AAA ATT GAA GAT AAA GAC GCT 8 93 

Leu Glu Asn Ser Asp Lys Lys He Glu Lys He Glu Asp Lys Asp Ala 
210 215 220 225 

AAA GAA ATC AAA CGC TTT TCT AAC ACC CTC TTT TTA TCC AGC GTG GAT 941 
Lys Glu He Lys Arg Phe Ser Asn Thr Leu Phe Leu Ser Ser Val Asp 
230 235 240 

AGG TAT TTC ACC ACC TTG CTT TTC ACT AAA GAT CCT CAA GGT TTT GAA 989 
Arg Tyr Phe Thr Thr Leu Leu Phe Thr Lys Asp Pro Gin Gly Phe Glu 
245 250 255 

GCC TTA ATT GAT TCA GAA ATC GGC ACT AAA AAC CCC TTA GGG TTC ATT 103 7 

Ala Leu He Asp Ser Glu He Gly Thr Lys Asn Pro Leu Gly Phe He 
260 265 270 
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TCC CTT AAA AAT GAA GCG-AAT TTG CAT GGC TAT ATT GGC CCT AAG GAT 1085 
Ser Leu Lys Asn Glu Ala Asn Leu His Gly Tyr lie Gly Pro Lys Asp 
275 280 285 

TAC CGC TCT TTG AAA GCG ATT TCA CCC ATG CTC ACC GAT GTG ATA GAG 1133 
Tyr Arg Ser Leu Lys Ala lie Ser Pro Met Leu Thr Asp Val lie Glu 
290 295 300 305 

TAT GGC TTA ATC ACT TTC TTT GCA AAA GGC GTG TTT GTT TTA CTG GAT 1181 
Tyr Gly Leu lie Thr Phe Phe Ala Lys Gly Val Phe Val Leu Leu Asp 
310 315 320 

TAT TTG TAT CAA TTC GTG GGC AAT TGG GGT TGG GCT ATC ATT CTT TTA 12 2 9 

Tyr Leu Tyr Gin Phe Val Gly Asn Trp Gly Trp Ala lie lie Leu Leu 
325 330 335 

ACG ATT ATC GTG CGC ATC ATC CTT TAT CCT TTA AGC TAT AAG GGC ATG 12 7 7 

Thr lie lie Val Arg lie lie Leu Tyr Pro Leu Ser Tyr Lys Gly Met 
340 345 350 

GTG AGC ATG CAA AAG CTC AAA GAA TTA GCC CCT AAA ATG AAA GAA CTC 1325 
Val Ser Met Gin Lys Leu Lys Glu Leu Ala Pro Lys Met Lys Glu Leu 
355 360 365 

CAA GAA AAA TAC AAG GGC GAA CCC CAA AAA TTG CAA GCC CAC ATG ATG 13 73 

Gin Glu Lys Tyr Lys Gly Glu Pro Gin Lys Leu Gin Ala His Met Met 
370 375 380 385 

CAG CTT TAC AAA AAA CAT GGG GCT AAC CCA CTA GGG GGT TGT CTG CCC 14 21 

Gin Leu Tyr Lys Lys His Gly Ala Asn Pro Leu Gly Gly Cys Leu Pro 
390 395 400 

TTA ATC TTA CAA ATC CCG GTG TTT TTT GCC ATT TAT AGA GTG CTT TAT 14 6 9 

Leu lie Leu Gin lie Pro Val Phe Phe Ala lie Tyr Arg Val Leu Tyr 
405 410 415 

AAC GCT GTG GAA TTG AAA AGC TCA GAG TGG ATC TTA TGG ATT CAT GAT 1517 
Asn Ala Val Glu Leu Lys Ser Ser Glu Trp lie Leu Trp lie His Asp 
420 425 430 

TTA TCC ATC ATG GAT CCG TAT TTT ATT TTA CCG CTT CTT ATG GGA GCG 15 65 

Leu Ser lie Met Asp Pro Tyr Phe lie Leu Pro Leu Leu Met Gly Ala 
435 440 445 

TCT ATG TAT TGG CAC CAA AGC GTT ACG CCA AAC ACC ATG ACC GAT CCC 1613 
Ser Met Tyr Trp His Gin Ser Val Thr Pro Asn Thr Met Thr Asp Pro 
450 455 460 465 

ATG CAA GCA AAG ATT TTT AAA CTC TTA CCC CTA TTA TTC AC A ATC TTT 16 61 

Met Gin Ala Lys lie Phe Lys Leu Leu Pro Leu Leu Phe Thr lie Phe 
470 475 480 

TTA ATC ACT TTC CCG GCA GGG TTA GTC TTG TAT TGG ACC ACG AAC AAC 17 09 

Leu lie Thr Phe Pro Ala Gly Leu Val Leu Tyr Trp Thr Thr Asn Asn 
485 490 495 
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ATC CTT TCG GTG TTU CAA CAA CTC ATC ATC AAT AAA GTC TTA GAG AAT 1757 
lie Leu Ser Val Leu Gin Gin Leu lie lie Asn Lys Val Leu Glu Asn 
500 505 510 

AAA AAA CGC ATG CAT GCG CAA AAC AAA AAG GAA CAT TGATGCAAAA TTTTAT 180 9 
Lys Lys Arg Met His Ala Gin Asn Lys Lys Glu His 
515 520 525 

TGAAATCAAA GCCAAAACCT TAG AAG AAG C CCTCATTCAA GCTTCTATCG CCTTGAATTG 18 6 9 

C C C C ATT ATT AATTTGCAAT ACGAAGTCAT TCAAACGCCC TCTAAAGGGT TTTTAAGCAT 192 9 

TGGTAAAAAA GAAGCCATTA TCTTAGCGGG CGTTAAAGA 196 8 



(2) INFORMATION FOR SEQ ID NO: 10 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 



(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 1 . . .22 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

Met Asp Lys Asn Asn Asn Asn Leu Arg Leu lie Leu Ala lie Ala Leu 

-20 -15 -10 

Ser Phe Leu Phe lie Ala Leu Tyr Ser Tyr Phe Phe Gin Lys Pro Asn 

-5 i 5 10 

Lys Thr Thr Thr Gin Thr Thr Lys Gin Glu Thr Thr Asn Asn His Thr 

15 20 25 

Ala Thr Ser Pro Asn Ala Pro Asn Ala Gin His Phe Ser Thr Thr Gin 

30 35 40 

Thr Thr Pro Gin Glu Asn Leu Leu Ser Thr lie Ser Phe Glu His Ala 

45 50 55 

Arg lie Glu lie Asp Ser Leu Gly Arg He Lys Gin Val Tyr Leu Lys 

60 65 70 

Asp Lys Lys Tyr Leu Thr Pro Lys Gin Lys Gly Phe Leu Glu His Val 
75 80 85 90 

Gly His Leu Phe Ser Ser Lys Glu Asn Ala Gin Pro Pro Leu Lys Glu 

95 100 105 

Leu Pro Leu Leu Ala Ala Asp Lys Leu Lys Pro Leu Glu Val Arg Phe 

110 115 120 

Leu Asp Pro Thr Leu Asn Asn Lys Ala Phe Asn Thr Pro Tyr Ser Ala 

125 130 135 

Ser Lys Thr Thr Leu Gly Pro Asn Glu Gin Leu Val Leu Thr Gin Asp 

140 145 150 

Leu Gly Thr Leu Ser He He Lys Thr Leu Thr Phe Tyr Asp Asp Leu 
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155 










16u- 










165 










170 


His 


Tyr 


Asp 


Leu 


Lys 


Ile 


Ala 


Phe 


Lys 


Ser 


Pro 


Asn 


Asn 


Leu 


He 


Pro 










175 










180 










185 




Ser 


Tyr 


Val 


He 


Thr 


Asn 


Gly 


Tyr 


Arg 


Pro 


Val 


Ala 


Asp 


Leu 


Asp 


Ser 








190 










195 










200 






Tyr 


Thr 


Phe 


Ser 


Gly 


Val 


Leu 


Leu 


Glu 


Asn 


Ser 


Asp 


Lys 


Lys 


He 


Glu 






205 










210 










215 








Lys 


He 


Glu 


Asp 


Lys 


Asp 


Ala 


Lys 


Glu 


He 


Lys 


Arg 


Phe 


Ser 


Asn 


Thr 




220 










225 










230 










Leu 


Phe 


Leu 


Ser 


Ser 


Val 


Asp 


Arg 


Tyr 


Phe 


Thr 


Thr 


Leu 


Leu 


Phe 


Thr 


235 










240 










245 










250 


Lys 


Asp 


Pro 


Gin 


Gly 


Phe 


Glu 


Ala 


Leu 


lie 


Asp 


Ser 


Glu 


He 


Gly 


Thr 










255 










260 










265 




Lys 


Asn 


Pro 


Leu 


Gly 


Phe 


He 


Ser 


Leu 


Lys 


Asn 


Glu 


Ala 


Asn 


Leu 


His 








270 










275 










280 






Gly 


Tyr 


He 


Gly 


Pro 


Lys 


Asp 


Tyr 


Arg 


Ser 


Leu 


Lys 


Ala 


He 


Ser 


Pro 






285 










290 










295 








Met 


Leu 


Thr 


Asp 


Val 


He 


Glu 


Tyr 


Gly 


Leu 


He 


Thr 


Phe 


Phe 


Ala 


Lys 




300 










305 










310 










Gly 


Val 


Phe 


Val 


Leu 


Leu 


Asp 


Tyr 


Leu 


Tyr 


Gin 


Phe 


Val 


Gly 


Asn 


Trp 


315 










320 










325 










330 


Gly 


Trp 


Ala 


He 


He 


Leu 


Leu 


Thr 


He 


lie 


Val 


Arg 


He 


He 


Leu 


Tyr 










335 










340 










345 




Pro 


Leu 


Ser 


Tyr 


Lys 


Gly 


Met 


Val 


Ser 


Met 


Gin 


Lys 


Leu 


Lys 


Glu 


Leu 








350 










355 










360 






Ala 


Pro 


Lys 


Met 


Lys 


Glu 


Leu 


Gin 


Glu 


Lys 


Tyr 


Lys 


Gly 


Glu 


Pro 


Gin 






365 










370 










375 








Lys 


Leu 


Gin 


Ala 


His 


Met 


Met 


Gin 


Leu 


Tyr 


Lys 


Lys 


His 


Gly 


Ala 


Asn 




380 










385 










390 










Pro 


Leu 


Gly 


Gly 


Cys 


Leu 


Pro 


Leu 


He 


Leu 


Gin 


He 


Pro 


Val 


Phe 


Phe 


395 










400 










405 










410 


Ala 


He 


Tyr 


Arg 


Val 


Leu 


Tyr 


Asn 


Ala 


Val 


Glu 


Leu 


Lys 


Ser 


Ser 


Glu 










415 










420 










425 




Trp 


He 


Leu 


Trp 


He 


His 


Asp 


Leu 


* Ser 


He 


Met 


Asp 


Pro 


Tyr 


Phe 


He 








430 










435 










440 






Leu 


Pro 


Leu 


Leu 


Met 


Gly 


Ala 


Ser 


Met 


Tyr 


Trp 


His 


Gin 


Ser 


Val 


Thr 






445 










450 










455 








Pro 


Asn 


Thr 


Met 


Thr 


Asp 


Pro 


Met 


Gin 


Ala 


Lys 


He 


Phe 


Lys 


Leu 


Leu 




460 










465 










470 










Pro 


Leu 


Leu 


Phe 


Thr 


He 


Phe 


Leu 


He 


Thr 


Phe 


Pro 


Ala 


Gly 


Leu 


Val 


475 










480 










485 










490 


Leu 


Tyr 


Trp 


Thr 


Thr 


Asn 


Asn 


He 


Leu 


Ser 


val 


Leu 


Gin 


Gin 


Leu 


He 










495 










500 










505 




He 


Asn 


Lys 


Val 


Leu 


Glu 


Asn 


Lys 


Lys 


Arg 


Met 


His 


Ala 


Gin 


Asn 


Lys 








510 










515 










520 







Lys Glu His 
525 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3280 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 151... 3207 
(D) OTHER INFORMATION: 



(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 151. -.241 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

TAAAGGTTTT AGGCCTGTGG TGGTTCAAGT TTTAGAAGAG CGCAGCAAGA TTTTTATCGT 6 0 

GAACGCTCAA AATTT AC AC C CTAATGACAG CGTGGCAGTG GGGTCATTGA TAGGGTTAAA 12 0 

AGGCATGATC AACAATTTAG GGGAGGAATG ATG CTC GCT TCC ATT ATT GAA TTT 174 

Met Leu Ala Ser He He Glu Phe 
-30 -25 

TCC TTA CGC CAA AG A GTG ATC GTG ATT GTT GGT GCG ATT CTT ATT TTA 22 2 

Ser Leu Arg Gin Arg Val He Val He Val Gly Ala He Leu He Leu 
-20 -15 -10 

TTT TTT GGG ACT TAT AGT TTT ATC AAC ACT CCA GTG GAC GCT TTC CCG 27 0 

Phe Phe Gly Thr Tyr Ser Phe He Asn Thr Pro Val Asp Ala Phe Pro 
-5 15 10 

GAT ATT TCG CCC ACT CAA GTT AAA ATC ATT TTA AAA CTC CCC GGC TCT 318 
Asp He Ser Pro Thr Gin Val Lys He He Leu Lys Leu Pro Gly Ser 
15 20 25 

AGC CCT GAA GAA ATG GAA AAC AAC ATC GTG CGC CCT TTA GAA TTG GAG 36 6 

Ser Pro Glu Glu Met Glu Asn Asn He Val Arg Pro Leu Glu Leu Glu 
30 35 40 

CTT TTA GGC TTG AAA GGG CAA AAA TCT TTA AGG AGT GTT TCA AAA TAT 414 
Leu Leu Gly Leu Lys Gly Gin Lys Ser Leu Arg Ser Val Ser Lys Tyr 
45 50 55 

TCT ATT TCA GAT ATT ACG ATA GAT TTT GAT GAC AGC GTG GAT ATT TAT 4 62 

Ser He Ser Asp He Thr He Asp Phe Asp Asp Ser Val Asp He Tyr 
60 65 70 

TTA GCG AGG AAT ATT GTC AAT GAG CGC TTG AGC AGC GTG ATG AAA GAT 510 
Leu Ala Arg Asn He Val Asn Glu Arg Leu Ser Ser Val Met Lys Asp 
75 80 85 90 

TTA CCC GTG GGG GTT GAG GGG GGC ATG GCG CCC ATT GTT ACG CCG CTA 55 8 

Leu Pro -Val Gly Val Glu Gly Gly Met Ala Pro He Val Thr Pro Leu 
95 100 105 
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TCA GAT ATC TTT ATG TTC- ACT ATT GAT GGC AAT ATC ACT GAG ATA GAA 6 06 

Ser Asp lie Phe Met Phe Thr lie Asp Gly Asn He Thr Glu He Glu 
110 115 120 

AAA CGA CAG CTT TTA GAT TTT GTG ATC CGC CCA CAA TTA AGA ATG ATT 654 
Lys Arg Gin Leu Leu Asp Phe Val He Arg Pro Gin Leu Arg Met He 
125 130 135 

AGC GGC GTA GCA GAT GTC AAT TCC ATT GGA GGC TTT AGC AGA GCG TTT 7 02 

Ser Gly Val Ala Asp Val Asn Ser He Gly Gly Phe Ser Arg Ala Phe 
140 145 150 

GTG ATC GTG CCG GAT TTT AAT GAC ATG GCA AGG CTT GGG GTG AGT ATT 75 0 

Val He Val Pro Asp Phe Asn Asp Met Ala Arg Leu Gly Val Ser He 
155 160 165 170 

TCT GAT TTA GAA TCG GCT GTG AGA GTG AAT TTA AGA AAC AGC GGA GCG 7 98 

Ser Asp Leu Glu Ser Ala Val Arg Val Asn Leu Arg Asn Ser Gly Ala 
175 180 185 

GGG CGC GTG GAT AGA GAT GGC GAA ACC TTT TTA GTC AAA ATC CAA ACC 84 6 

Gly Arg Val Asp Arg Asp Gly Glu Thr Phe Leu Val Lys He Gin Thr 
190 195 200 

GCT TCT TTG AGT TTA GAA GAC ATT GGC AAA ATC ACC GTT TCC ACT AAT 8 94 

Ala Ser Leu Ser Leu Glu Asp He Gly Lys He Thr Val Ser Thr Asn 
205 210 215 

TTA GGG CAT TTG CAC ATT AAG GAT TTT GCG AAA GTC ATC AGC CAG TCT 94 2 

Leu Gly His Leu His He Lys Asp Phe Ala Lys Val He Ser Gin Ser 

220 225 230 

CGC ACC CGT TTG GGG TTT GTT ACT AAA GAT GGC GTG GGC GAG ACC ACA 99 0 

Arg Thr Arg Leu Gly Phe Val Thr Lys Asp Gly Val Gly Glu Thr Thr 

235 240 245 250 

GAA GGC TTG GTG CTT TCT TTA AAA GAC GCT AAC ACC AAA GAA ATC ATC 103 8 

Glu Gly Leu Val Leu Ser Leu Lys Asp Ala Asn Thr Lys Glu He He 
255 260 265 

ACT CAA GTG TAT CAA AAA CTA GAA GAA TTA AAA CCC TTT TTA CCG AAT 10 8 6 

Thr Gin Val Tyr Gin Lys Leu Glu Glu Leu Lys Pro Phe Leu Pro Asn 
270 275 280 

GGC GTG TCC ATT AAT GTT TTT TAT GAT CGC TCA GAA TTT ACG CAA AAA 1134 
Gly Val Ser He Asn Val Phe Tyr Asp Arg Ser Glu Phe Thr Gin Lys 
285 290 295 

GCC ATT GCC ACC GTT TCT AAA ACC CTC ATT GAA GCC GTT GTT TTA ATC 118 2 

Ala He Ala Thr Val Ser Lys Thr Leu He Glu Ala Val Val Leu He 
300 305 310 

ATC ATC ACG CTC TTT TTA TTT TTA GGG AAT TTG AGG GCG AGC GTG GCT 12 3 0 

He He Thr Leu Phe Leu Phe Leu Gly Asn Leu Arg Ala Ser Val Ala 
315 320 325 330 
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GTG GGG GTG ATT Tia CCT ~ TTA AGC TTG TCC GTG GCG TTT ATT TTT ATC 12 7 8 

Val Gly Val lie Leu Pro Leu Ser Leu Ser Val Ala Phe lie Phe lie 
335 340 345 

AAG TTT AGC GAT CTG ACT TTA AAT TTG ATG AGT TTA GGG GGA TTG GTT 13 2 6 

Lys Phe Ser Asp Leu Thr Leu Asn Leu Met Ser Leu Gly Gly Leu Val 
350 355 360 

ATC GCT ATA GGC ATG CTC ATT GAC TCA GCC GTG GTG GTG GTG GAA AAC 13 74 

lie Ala lie Gly Met Leu lie Asp Ser Ala Val Val Val Val Glu Asn 
365 370 375 

GCT TTT GAA AAA TTA AGC GCT AAC ACT AAA ACC ACT AAA CTC CAT GCA 14 2 2 

Ala Phe Glu Lys Leu Ser Ala Asn Thr Lys Thr Thr Lys Leu His Ala 
380 385 390 

ATC TAT CGT TCG TGT AAA GAA ATC GCT GTT TCA GTG GTG AGC GGG GTG 14 7 0 

lie Tyr Arg Ser Cys Lys Glu lie Ala Val Ser Val Val Ser Gly Val 
395 400 405 410 

GTG ATC ATC ATT GTG TTT TTT GTG CCG ATT TTA ACC TTA CAG GGG TTA 1518 
Val lie He lie Val Phe Phe Val Pro He Leu Thr Leu Gin Gly Leu 
415 420 425 

GAG GGT AAG ATG TTT AGG CCT TTA GCG CAA AGC ATT GTG TAT GCG CTT 15 66 

Glu Gly Lys Met Phe Arg Pro Leu Ala Gin Ser He Val Tyr Ala Leu 
430 435 440 

TTA GGC ACT TTA GTT CTA TCT ATT ACA ATC ATT CCT GTA GTC AGC TCT 1614 
Leu Gly Thr Leu Val Leu Ser He Thr He He Pro Val Val Ser Ser 
445 450 455 

CTT GTC TTA AAA GCC ACG CCC CAT AGC GAA ACC TTT TTA ACG AGG TTT 1662 
Leu Val Leu Lys Ala Thr Pro His Ser Glu Thr Phe Leu Thr Arg Phe 
460 465 470 

TTA AAC AGA ATC TAC GCC CCT TTA TTG GAA TTT TTT GTG CAT AAC CCT 1710 
Leu Asn Arg He Tyr Ala Pro Leu Leu Glu Phe Phe Val His Asn Pro 
475 480 485 490 

AAA AAA GTG ATT TTA GGA GCG TTT GTT TTT TTA ATC GCA AGC CTT TCT 17 5 8 

Lys Lys Val He Leu Gly Ala Phe Val Phe Leu He Ala Ser Leu Ser 
495 500 505 

TTA TTC CCT TTT GTG GGG AAG AAT TTC ATG CCC GTT TTA GAT GAG GGC 18 06 

Leu Phe Pro Phe Val Gly Lys Asn Phe Met Pro Val Leu Asp Glu Gly 
510 515 520 

GAT GTG GTT TTG AGC GTG GAA ACC ACC CCT TCT ATT TCT TTA GAT CAA 18 5 4 

Asp Val Val Leu Ser Val Glu Thr Thr Pro Ser He Ser Leu Asp Gin 
525 530 535 

TCT AGG GAT CTC ATG CTA AAC ATT GAG AGC GCG ATT AAA AAG CAT GTC 1902 
Ser Arg Asp Leu Met Leu Asn He Glu Ser Ala He Lys Lys His Val 
540 545 550 
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AAG GAA GTT AAA AGC ATT -GTC GCG CGC ACA GGG AGC GAT GAA TTG GGG 195 0 

Lys Glu Val Lys Ser lie Val Ala Arg Thr Gly Ser Asp Glu Leu Gly 
555 560 565 570 

CTG GAT TTA GGA GGT TTG AAT CAA ACC GAT ACT TTT ATT TCT TTT ATT 199 8 

Leu Asp Leu Gly Gly Leu Asn Gin Thr Asp Thr Phe lie Ser Phe lie 
575 580 585 

CCT AAA AAA GAA TGG AGC GTT AAA ACC AAA GAT GAA TTA TTA GAA AAA 2 04 6 

Pro Lys Lys Glu Trp Ser Val Lys Thr Lys Asp Glu Leu Leu Glu Lys 
590 595 600 

ATC ATG GAT TCT TTA AAA GAC TTT AAG GGG ATT AAC TTT TCT TTC ACC 2 0 94 

lie Met Asp Ser Leu Lys Asp Phe Lys Gly lie Asn Phe Ser Phe Thr 
605 610 615 

CAA CCC ATT GAA ATG AGA ATT TCT GAA ATG CTG ACA GGG GTT AGG GGG 214 2 

Gin Pro lie Glu Met Arg lie Ser Glu Met Leu Thr Gly Val Arg Gly 
620 625 630 

GAT TTA GCG GTT AAG ATT TTT GGA GAT GGT ATT AGC GAA TTG AAT GAA 2190 
Asp Leu Ala Val Lys lie Phe Gly Asp Gly lie Ser Glu Leu Asn Glu 
635 640 645 650 

TTG AGT TTT CAA ATC GCG CAA GCT CTA AAA GGG ATT AAA GGA TCT AGT 223 8 

Leu Ser Phe Gin He Ala Gin Ala Leu Lys Gly He Lys Gly Ser Ser 
655 660 665 

GAA GTT TTA ACC ACG CTT AAT GAG GGC GTG AAT TAT TTG TAT GTA ACC 22 86 

Glu Val Leu Thr Thr Leu Asn Glu Gly Val Asn Tyr Leu Tyr Val Thr 
670 675 680 

CCT AAT AAA GAA TCG ATG GCG GAT GTG GGG ATC ACT AGC GAT GAA TTT 2 3 34 

Pro Asn Lys Glu Ser Met Ala Asp Val Gly He Thr Ser Asp Glu Phe 
685 690 695 

TCC AAG TTT TTA AAA TCC GCT TTA GAG GGC TTG GTT GTA GAT GTG ATC 23 82 

Ser Lys Phe Leu Lys Ser Ala Leu Glu Gly Leu Val Val Asp Val lie 
700 705 710 

CCT ACA GGG ATT TCA CGC ACG CCA GTG ATG ATC CGC CAA GAG AGC GAT 24 3 0 

Pro Thr Gly He Ser Arg Thr Pro Val Met He Arg Gin Glu Ser Asp 
715 720 725 730 

TTT GCA AGC TCT ATC ACT AAA ATC AAA AGT TTA GCC TTG ACT TCA AAA 24 7 8 

' Phe Ala Ser Ser He Thr Lys lie Lys Ser Leu Ala Leu Thr Ser Lys 
735 740 745 

TAT GGC GTT TTA GTG CCT ATC ACT TCT ATC GCC AAA ATT GAA GAA GTG 2 52 6 

Tyr Gly Val Leu Val Pro He Thr Ser He Ala Lys He Glu Glu Val 
750 755 760 

GAT GGC CCT GTT TCT GTT GTG CGT GAA AAT TCA ATG CGC ATG AGC GTG 2 574 

Asp Gly Pro Val Ser Val Val Arg Glu Asn Ser Met Arg Met Ser Val 
765 770 775 
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GTT CGC AGT AAT GTG GTG -GGG CGC GAT TTG AAA TCT TTT GTA GAA GAG 2622 
Val Arg Ser Asn Val Val Gly Arg Asp Leu Lys Ser Phe Val Glu Glu 
780 785 790 

GCT AAA AAA GTG ATC GCT CAA AAC ATC AAA CTC CCT CCC AGC TAC TAT 2 67 0 

Ala Lys Lys Val He Ala Gin Asn He Lys Leu Pro Pro Ser Tyr Tyr 
795 800 805 810 

ATC ACT TAT GGG GGG CAG TTT GAA AAC CAG CAA CGG GCC AAT AAA AGG 2718 
He Thr Tyr Gly Gly Gin Phe Glu Asn Gin Gin Arg Ala Asn Lys Arg 
815 820 825 

CTC TCC ACC GTT ATC CCT TTA AGC ATC TTA GCG ATT TTT TTC ATT CTT 2 76 6 

Leu Ser Thr Val He Pro Leu Ser He Leu Ala He Phe Phe He Leu 
830 835 840 

TTT TTC ACT TTT AAA AGC ATT CCT TTA GCC TTG CTC ATT CTT TTG AAT 2 814 

Phe Phe Thr Phe Lys Ser He Pro Leu Ala Leu Leu He Leu Leu Asn 
845 850 855 

ATC CCT TTT GCG GTT ACC GGA GGC CTT ATT GCG TTG TTT GCG GTC GGG 2 8 62 

j le Pro Phe Ala Val Thr Gly Gly Leu He Ala Leu Phe Ala Val Gly 

860 865 870 

GAG TAT ATT TCA GTG CCA GCG AGC GTG GGC TTT ATC GCT CTT TTT GGG 2 910 

Glu Tyr He Ser Val Pro Ala Ser Val Gly Phe He Ala Leu Phe Gly 
875 880 885 890 

ATT GCG GTT TTA AAT GGC GTG GTG ATG ATA GGC TAT TTT AAA GAG CTT 2 95 8 

He Ala Val Leu Asn Gly Val Val Met lie Gly Tyr Phe Lys Glu Leu 
895 900 905 

CTC TTG CAA GGG AAA AGC GTA GAA GAA TGC GTT TTA TTG GGC GCT AAA 3 0 06 

Leu Leu Gin Gly Lys Ser Val Glu Glu Cys Val Leu Leu Gly Ala Lys 
910 915 920 

AGG CGT TTG AGA CCG GTT TTA ATG ACC GCT TGC ATT GCC GGT TTG GGT 3 0 54 

Arg Arg Leu Arg Pro Val Leu Met Thr Ala Cys He Ala Gly Leu Gly 
925 930 935 

TTG CTC CCT TTA TTA TTT TCT CAT AGC GTG GGA TCA GAA GTC CAA AAA 3102 
Leu Leu Pro Leu Leu Phe Ser His Ser Val Gly Ser Glu Val Gin Lys 
940 945 950 

CCT TTA GCG ATC GTG GTG CTT GGA GGC TTG GTT ACC TCA AGC GCT CTA 315 0 

Pro Leu Ala He Val Val Leu Gly Gly Leu Val Thr Ser Ser Ala Leu 
955 960 965 970 

ACC TTA CTC CTA CTG CCG CCA ATG TTT ATG CTC ATC GCT AAA AAG ATT 3198 
Thr Leu Leu Leu Leu Pro Pro Met Phe Met Leu He Ala Lys Lys He 
975 980 985 

AAA ATC GTT TG AG TTAAAG GATTTCACAT GCTCGCTTTA GAAATTTATA TTGATATTT 3 2 56 
Lys He Val 



BNSDOCID: <WO 9B21225A1_i_> 



WO 98/21225 



PCT/US97/21353 



-231- 

GTTTGAAAGA CGCTTTAATA GATT 32 80 

(2) INFORMATION FOR SEQ ID NO: 10 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1019 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 1 ... 30 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 8: 

Met Leu Ala Ser lie lie Glu Phe Ser Leu Arg Gin Arg Val lie Val 
-30 -25 -20 -15 

lie Val Gly Ala lie Leu lie Leu Phe Phe Gly Thr Tyr Ser Phe lie 

-10 -5 1 

Asn Thr Pro Val Asp Ala Phe Pro Asp lie Ser Pro Thr Gin Val Lys 

5 10 15 

lie lie Leu Lys Leu Pro Gly Ser Ser Pro Glu Glu Met Glu Asn Asn 

20 25 30 

lie Val Arg Pro Leu Glu Leu Glu Leu Leu Gly Leu Lys Gly Gin Lys 
35 40 45 50 

Ser Leu Arg Ser Val Ser Lys Tyr Ser lie Ser Asp lie Thr lie Asp 

55 60 65 

Phe Asp Asp Ser Val Asp lie Tyr Leu Ala Arg Asn lie Val Asn Glu 

70 75 80 

Arg Leu Ser Ser Val Met Lys Asp Leu Pro Val Gly Val Glu Gly Gly 

85 90 95 

Met Ala Pro lie Val Thr Pro Leu Ser Asp lie Phe Met Phe Thr lie 

100 105 110 

Asp Gly Asn lie Thr Glu lie Glu Lys Arg Gin Leu Leu Asp Phe Val 
115 120 125 130 

lie Arg Pro Gin Leu Arg Met lie Ser Gly Val Ala Asp Val Asn Ser 

135 140 145 

lie Gly Gly Phe Ser Arg Ala Phe Val lie Val Pro Asp Phe Asn Asp 

150 155 160 

Met Ala Arg Leu Gly Val Ser lie Ser Asp Leu Glu Ser Ala Val Arg 

165 170 175 

Val Asn Leu Arg Asn Ser Gly Ala Gly Arg Val Asp Arg Asp Gly Glu 

180 185 190 

Thr Phe Leu Val Lys lie Gin Thr Ala Ser Leu Ser Leu Glu Asp lie 
195 200 205 210 

Gly Lys lie Thr Val Ser Thr Asn Leu Gly His Leu His lie Lys Asp 

215 220 225 

Phe Ala Lys Val lie Ser Gin Ser Arg Thr Arg Leu Gly Phe Val Thr 
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230 235 240 



Lys 


Asp 


Gly 


Val 


Gly 


Glu 


Thr 


Thr 


Glu 


Gly 


Leu 


Val 


Leu 


Ser 


Leu 


Lys 




245 










250 










255 








Asp 


Ala 


Asn 


Thr 


Lys 


Glu 


He 


He 


Thr 


Gin 


Val 


Tyr 


Gin 


Lys 


Leu 


Glu 


260 










265 










270 










Glu 


Leu 


Lys 


Pro 


Phe 


Leu 


Pro 


Asn 


Gly 


Val 


Ser 


He 


Asn 


Val 


Phe 


Tyr 


275 










280 










285 










290 


Asp 


Arg 


Ser 


Glu 


Phe 


Thr 


Gin 


Lys 


Ala 


He 


Ala 


Thr 


Val 


Ser 


Lys 


Thr 








295 










300 










305 




Leu 


lie 


Glu 


Ala 


Val 


Val 


Leu 


He 


He 


He 


Thr 


Leu 


Phe 


Leu 


Phe 


Leu 








310 










315 










320 






Gly 


Asn 


Leu 


Arg 


Ala 


Ser 


Val 


Ala 


Val 


Gly 


Val 


He 


Leu 


Pro 


Leu 


Ser 






325 










330 










335 








Leu 


Ser 


Val 


Ala 


Phe 


He 


Phe 


He 


Lys 


Phe 


Sei- 


Asp 


Leu 


Thr 


Leu 


Asn 




340 










345 










3 50 










Leu 


Met 


Ser 


Leu 


Gly 


Gly 


Leu 


Val 


He 


Ala 


ne 


Gly 


Met 


Leu 


He 


Asp 


355 










360 










365 










370 


Ser 


Ala 


Val 


Val 


Val 


Val 


Glu 


Asn 


Ala 


Phe 


Glu 


Lys 


Leu 


Ser 


Ala 


Asn 










375 










380 










385 




Thr 


Lys 


Thr 


Thr 


Lys 


Leu 


His 


Ala 


He 


Tyr 


Arg 


Ser 


Cys 


Lys 


Glu 


lie 








390 










395 










400 






Ala 


Val 


Ser 


Val 


Val 


Ser 


Gly 


Val 


Val 


He 


He 


He 


Val 


Phe 


Phe 


Val 






405 










410 










4 15 








Pro 


He 


Leu 


Thr 


Leu 


Gin 


Gly 


Leu 


Glu 


Gly 


Lys 


Met 


Phe 


Arg 


Pro 


Leu 




420 










425 










430 










Ala 


Gin 


Ser 


He 


Val 


Tyr 


Ala 


Leu 


Leu 


Gly 


Thr 


Leu 


Val 


Leu 


Ser 


He 


435 










440 










445 










4 50 


Thr 


He 


He 


Pro 


Val 


Val 


Ser 


Ser 


Leu 


Val 


Leu 


Lys 


Ala 


Thr 


Pro 


His 










455 










460 










465 




Ser 


Glu 


Thr 


Phe 


Leu 


Thr 


Arg 


Phe 


Leu 


Asn 


Arg 


He 


Tyr 


Ala 


Pro 


Leu 








470 










475 










480 






Leu 


Glu 


Phe 


Phe 


Val 


His 


Asn 


Pro 


Lys 


Lys 


Val 


He 


Leu 


Gly 


Ala 


Phe 






485 










490 










4 95 








Val 


Phe 


Leu 


He 


Ala 


Ser 


Leu 


Ser 


Leu 


Phe 


Pro 


Phe 


Val 


Gly 


Lys 


Asn 




500 










505 










510 










Phe 


Met 


Pro 


Val 


Leu 


Asp 


Glu 


Gly 


Asp 


Val 


Val 


Leu 


Ser 


Val 


Glu 


Thr 


515 










520 










525 










53 0 


Thr 


Pro 


Ser 


He 


Ser 


Leu 


Asp 


Gin 


Ser 


Arg 


Asp 


Leu 


Met 


Leu 


Asn 


lie 










535 










540 










545 




Glu 


Ser 


Ala 


He 


Lys 


Lys 


His 


Val 


Lys 


Glu 


Val 


Lys 


Ser 


He 


Val 


Ala 








550 










555 










560 






Arg 


Thr 


Gly 


Ser 


Asp 


Glu 


Leu 


Gly 


Leu 


Asp 


Leu 


Gly 


Gly 


Leu 


Asn 


Gin 






565 










570 










575 








Thr 


Asp 


Thr 


Phe 


He 


Ser 


Phe 


He 


Pro 


Lys 


Lys 


Glu 


Trp 


Ser 


Val 


Lys 




580 










5 85 




















Thr 


Lys 


Asp 


Glu 


Leu 


Leu 


Glu 


Lys 


He 


Met 


Asp 


Ser 


Leu 


Lys 


Asp 


Phe 


595 








600 










605 










610 


Lys 


Gly 


He 


Asn 


Phe 


Ser 


Phe 


Thr 


Gin 


Pro 


He 


Glu 


Met 


Arg 


He 


Ser 






615 










620 










625 




Glu 


Met 


Leu 


Thr 


Gly 


Val 


Arg 


Gly 


Asp 


Leu 


Ala 


Val 


Lys 


He 


Phe 


Gly 








630 










635 










640 






Asp 


Gly 


He 


Ser 


Glu 


Leu 


Asn 


Glu 


Leu 


Ser 


Phe 


Gin 


lie 


Ala 


Gin 


Ala 


645 










650 










655 








Leu 


Lys 


Gly 


He 


Lys 


Gly 


Ser 


Ser 


Glu 


Val 


Leu 


Thr 


Thr 


Leu 


Asn 


Glu 



660 665 670 
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Gly Val Asn Tyr Leu Tyr -Val Thr Pro Asn Lys Glu Ser Met Ala Asp 
675 680 685 690 

Val Gly lie Thr Ser Asp Glu Phe Ser Lys Phe Leu Lys Ser Ala Leu 

695 700 705 

Glu Gly Leu Val Val Asp Val He Pro Thr Gly He Ser Arg Thr Pro 

710 715 720 

Val Met He Arg Gin Glu Ser Asp Phe Ala Ser Ser He Thr Lys He 

725 730 735 

Lys Ser Leu Ala Leu Thr Ser Lys Tyr Gly Val Leu Val Pro He Thr 

740 745 750 

Ser He Ala Lys He Glu Glu Val Asp Gly Pro Val Ser Val Val Arg 
755 760 765 770 

Glu Asn Ser Met Arg Met Ser Val Val Arg Ser Asn Val Val Gly Arg 

775 780 785 

Asp Leu Lys Ser Phe Val Glu Glu Ala Lys Lys Val He Ala Gin Asn 

790 795 800 

lie Lys Leu Pro Pro Ser Tyr Tyr He Thr Tyr Gly Gly Gin Phe Glu 

805 810 815 

Asn Gin Gin Arg Ala Asn Lys Arg Leu Ser Thr Val He Pro Leu Ser 

820 825 830 

He Leu Ala He Phe Phe He Leu Phe Phe Thr Phe Lys Ser He Pro 
835 840 845 850 

Leu Ala Leu Leu He Leu Leu Asn He Pro Phe Ala Val Thr Gly Gly 

855 860 865 

Leu He Ala Leu Phe Ala Val Gly Glu Tyr He Ser Val Pro Ala Ser 

870 875 880 

Val Gly Phe He Ala Leu Phe Gly He Ala Val Leu Asn Gly Val Val 

885 890 895 

Met He Gly Tyr Phe Lys Glu Leu Leu Leu Gin Gly Lys Ser Val Glu 

900 905 910 

Glu Cys Val Leu Leu Gly Ala Lys Arg Arg Leu Arg Pro Val Leu Met 
915 920 925 930 

Thr Ala Cys He Ala Gly Leu Gly Leu Leu Pro Leu Leu Phe Ser His 

935 940 945 

Ser Val Gly Ser Glu Val Gin Lys Pro Leu Ala He Val Val Leu Gly 

950 955 960 

Gly Leu Val Thr Ser Ser Ala Leu Thr Leu Leu Leu Leu Pro Pro Met 

965 970 975 

Phe Met Leu He Ala Lys Lys He Lys He Val 
980 985 



(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 
"(B) LOCATION: 86. -.835 
(D) OTHER INFORMATION: 
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( A) NAME /KEY : Signal Sequence 

(B) LOCATION: 86.. -161 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

GCATAAAATA AACAAACATT AAGTAAGGCT TATCAATATT TGATTACAAT TATAAGGGTT 6 0 

ACATTTTTTT AATAGGAGAT ATACC ATG CTA GGA AAC GTT AAA AAA ACC CTT 112 

Met Leu Gly Asn Val Lys Lys Thr Leu 

-25 -20 

TTT GGG GTC TTG TGT TTG GGC ACG TTG TGT TTG AGA GGG TTA ATG GCA 16 0 

Phe Gly Val Leu Cys Leu Gly Thr Leu Cys Leu Arg Gly Leu Met Ala 
-15 -10 -5 

GAG CCA GAC GCT AAA GAG CTT GTT AAT TTA GGC ATA GAG AGC GCG AAG 20 8 

Glu Pro Asp Ala Lys Glu Leu Val Asn Leu Gly lie Glu Ser Ala Lys 
! 5 10 15 

AAG CAA GAT TTC GCT CAA GCT AAA ACG CAT TTT GAA AAA GCT TGT GAG 2 56 

Lys Gin Asp Phe Ala Gin Ala Lys Thr His Phe Glu Lys Ala Cys Glu 
20 25 30 

TTA AAA AAT GGC TTT GGA TGT GTT TTT TTA GGG GCG TTC TAT GAA GAA 3 04 

Leu Lys Asn Gly Phe Gly Cys Val Phe Leu Gly Ala Phe Tyr Glu Glu 
35 40 45 

GGG AAA GGA GTG GGA AAA GAC TTG AAA AAA GCC ATC CAA TTT TAC ACT 3 52 

Gly Lys Gly Val Gly Lys Asp Leu Lys Lys Ala He Gin Phe Tyr Thr 
50 55 60 

AAA GGT TGT GAA TTA AAT GAT GGT TAT GGG TGT AAC CTG CTA GGA AAT 4 00 

Lys Gly Cys Glu Leu Asn Asp Gly Tyr Gly Cys Asn Leu Leu Gly Asn 
65 70 75 80 

TTA TAC TAT AAC GGA CAA GGC GTG TCA AAA GAC GCT AAA AAA GCC TCA 44 8 

Leu Tyr Tyr Asn Gly Gin Gly Val Ser Lys Asp Ala Lys Lys Ala Ser 
85 90 95 

CAA TAC TAC TCT AAA GCT TGC GAC TTA AAC CAT GCT GAA GGG TGT ATG 4 96 

Gin Tyr Tyr Ser Lys Ala Cys Asp Leu Asn His Ala Glu Gly Cys Met 
100 105 HO 

GTA TTA GGA AGC TTA CAC CAT TAT GGC GTA GGC ACG CCT AAG GAT TTA 54 4 

Val Leu Gly Ser Leu His His Tyr Gly Val Gly Thr Pro Lys Asp Leu 
115 120 125 

AGA AAG GCT CTT GAT TTG TAT GAA AAA GCT TGC GAT TTA AAA GAC AGC 5 92 

Arg Lys Ala Leu Asp Leu Tyr Glu Lys Ala Cys Asp Leu Lys Asp Ser 
130 135 140 
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CCA GGG TGT ATT AAT GCA--GGA TAT ATA TAT AGT GTA ACA AAG AAT TTT 64 0 

Pro Gly Cys lie Asn Ala Gly Tyr lie Tyr Ser Val Thr Lys Asn Phe 
145 150 155 160 

AAG GAG GCT ATC GTT CGT TAT TCT AAA GCA TGC GAA TTA AAA GAT GGT 68 8 

Lys Glu Ala lie Val Arg Tyr Ser Lys Ala Cys Glu Leu Lys Asp Gly 
165 170 175 

AGG GGG TGT TAT AAT TTA GGG GTT ATG CAA TAC AAC GCT CAA GGT ACA 73 6 

Arg Gly Cys Tyr Asn Leu Gly Val Met Gin Tyr Asn Ala Gin Gly Thr 
180 185 190 

GCA AAG GAC GAA AAG CAA GCG GTA GAA AAC TTT AAA AAA GGC TGC AAA 7 84 

Ala Lys Asp Glu Lys Gin Ala Val Glu Asn Phe Lys Lys Gly Cys Lys 

195 200 205 

TCA AGC GTT AAA GAA GCA TGC GAC GCT CTC AAG GAA TTA AAA ATA GAA 83 2 

Ser Ser Val Lys Glu Ala Cys Asp Ala Leu Lys Glu Leu Lys lie Glu 
210 215 220 

CTT TAATTTCAAT GAAGTTAGCT AAACGCTGCG TTTAGCTGGC TTTTACGCTT TTT AT A 8 91 

Leu 

225 

TTTTAAG 8 98 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 1. . .25 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Met Leu Gly Asn Val Lys Lys Thr Leu Phe Gly Val Leu Cys Leu Gly 
-25 -20 -15 -10 

Thr Leu Cys Leu Arg Gly Leu Met Ala Glu Pro Asp Ala Lys Glu Leu 

-5 15 

Val Asn Leu Gly He Glu Ser Ala Lys Lys Gin Asp Phe Ala Gin Ala 

10 15 20 

Lys Thr His Phe Glu Lys Ala Cys Glu Leu Lys Asn Gly Phe Gly Cys 

25 30 35 

Val Phe Leu Gly Ala Phe Tyr Glu Glu Gly Lys Gly Val Gly Lys Asp 
40 45 50 55 
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Leu 


Lys 


Lys 


Ala 


lie 


uin- 


Phe 


Tyr 


Thr 


Lys 


Gly 


Cys 


Glu 


Leu 


Asn 


Asp 










60 










65 










7 0 




Gly 


Tyr 


Gly 


Cys 


Asn 


Leu 


Leu 


Gly 


Asn 


Leu 


Tyr 


Tyr 


Asn 


Gly 


Gin 


Gly 








75 










80 










85 






Val 


Ser 


Lys 


Asp 


Ala 


Lys 


Lys 


Ala 


Ser 


Gin 


Tyr 


Tyr 


Ser 


Lys 


Ala 


Cys 






90 










95 










100 








Asp 


Leu 


Asn 


His 


Ala 


Glu 


Gly 


Cys 


Met 


Val 


Leu 


Gly 


Ser 


Leu 


Hi s 


His 




105 










110 










115 










Tyr 


Gly 


Val 


Gly 


Thr 


Pro 


Lys 


Asp 


Leu 


Arg 


Lys 


Ala 


Leu 


Asp 


Leu 


Tyr 


120 










125 










13 0 










13 5 


Glu 


Lys 


Ala 


Cys 


Asp 


Leu 


Lys 


Asp 


Ser 


Pro 


Gly 


Cys 


lie 


Asn 


Al a 


(jiy 










140 










14 5 










13U 




Tyr 


lie 


Tyr 


Ser 


val 


Thr 


Lys 


Asn 




Lys 




Ala 


He 


Val 


Arg 


j. yx 






155 










160 










165 






Ser 


Lys 


Ala 


Cys 


Glu 


Leu 


Lys 


Asp 


Gly 


Arg 


Gly 


Cys 


Tyr 


Asn 


Leu 


Gly 






170 










175 










180 








Val 


Met 


Gin 


Tyr 


Asn 


Ala 


Gin 


Gly 


Thr 


Ala 


Lys 


Asp 


Glu 


Lys 


Gin 


Ala 




185 










190 










195 










Val 


Glu 


Asn 


Phe 


Lys 


Lys 


Gly 


Cys 


Lys 


Ser 


Ser 


Val 


Lys 


Glu 


Ala 


Cys 


200 










205 










210 










215 


Asp 


Ala 


Leu 


Lys 


Glu 


Leu 


Lys 


lie 


Glu 


Leu 















220 225 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 169... 834 
(D) OTHER INFORMATION: 



(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 169.. .289 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CAAAAAAAAA AAAAAACAAT TTCAGTTTCT TATTAGCTAG GTTTGATTAA AATGAAAAGC 6 0 

TTTTATGTGT TTAAACTTCA TTGTCTTAAA ACTTTTAAGA GCAATTTTAA AATTCGTTGG 12 0 

CGTATAATAT CCGTTTTGAA TGAACTACTA AAAAAAGGGT TTTAAATA ATG GCT GAA 177 

Met Ala Glu 
-40 

AAT TCT TTC AAA AAT GTT TCC ACA CAA CCC AAA GTA TTT TTC TTA TTG 22 5 
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Asn Ser Phe Lys Asn vai Ser Thr Gin Pro Lys Val Phe Phe Leu Leu 
-35 -30 -25 

CCA GCT AAA ACC CTG TTT CTT TTA GGA GGC GTT TTT AGC GCG TTT TTT 273 
Pro Ala Lys Thr Leu Phe Leu Leu Gly Gly Val Phe Ser Ala Phe Phe 
-20 -15 -10 

ATC CTT ATT GCT GGC TTG GTT TTT TTT GAT TAT GCT CAT TTG ATG GAC 321 

lie Leu lie Ala Gly Leu Val Phe Phe Asp Tyr Ala His Leu Met Asp 

-5 15 10 

AAT GCC ATT TTT AAT TTT GCG CGT TCA ACC CCC TTT AAT TCC AGC CCT 3 69 

Asn Ala lie Phe Asn Phe Ala Arg Ser Thr Pro Phe Asn Ser Ser Pro 

15 20 25 

ATT TTA ACT CTA ATC CTC CAA AAT ATC GCT AAT TTA GGC TCT TCT CAA 417 

lie Leu Thr Leu lie Leu Gin Asn lie Ala Asn Leu Gly Ser Ser Gin 

30 35 40 

TTC GTG TTG CCT TTG AGT TTG TTG GTG GGG GTG TTT TTA AGC CTT TAT 4 65 

Phe Val Leu Pro Leu Ser Leu Leu Val Gly Val Phe Leu Ser Leu Tyr 
45 50 55 

CGC AGA AAC TTA GTG CTT GGG GTG TGG TTT GTG TTA AGC GTG ATC TTG 513 
Arg Arg Asn Leu Val Leu Gly Val Trp Phe Val Leu Ser Val lie Leu 
60 65 70 75 

TTT GAA GCC CTT TTA GAA TCT TTA AAA CAC CTT TTT GCA TAT TCC ATT 561 
Phe Glu Ala Leu Leu Glu Ser Leu Lys His Leu Phe Ala Tyr Ser lie 
80 85 90 

CAG TGG CTT TCG CGC AGC GCT AAT TTC CCT AAC GCT ACT GCG CTT TCT 6 09 

Gin Trp Leu Ser Arg Ser Ala Asn Phe Pro Asn Ala Thr Ala Leu Ser 
95 100 105 

TTA GTG CTA TTT TAT GGG TTG CTT ATT TTA TTG ATA CCC CAT TTA ATC 657 
Leu Val Leu Phe Tyr Gly Leu Leu lie Leu Leu lie Pro His Leu lie 
110 115 120 

ACG CAT CAA ACG CTT AAA AAT GTT CTT TTT TAT AGC TTA TTT GGT TTG 7 05 

Thr His Gin Thr Leu Lys Asn Val Leu Phe Tyr Ser Leu Phe Gly Leu 
125 130 135 

ATT TTT TTA ATA GGG TTA GCA CTG ATT GTT TTA GGG GTT TCT TTC AGT 75 3 

lie Phe Leu lie Gly Leu Ala Leu lie Val Leu Gly Val Ser Phe Ser 
140 145 150 155 

AGT GTT TTA GGA GGG TTT TGT TTA GGG GCG TTA GGG GCT TGT TTT TCC 8 01 

Ser Val Leu Gly Gly Phe Cys Leu Gly Ala Leu Gly Ala Cys Phe Ser 
160 165 170 

ATA GGG ATT TAT TTG AGC GTG TTT CAA AAG ATC TAAACGAACG G CTTAAAAG A 854 
lie Gly lie Tyr Leu Ser Val Phe Gin Lys lie 
175 180 

ATGAAAATTT TATCAAGGTT TTAATATTGG ATTTAAAGGT ATT ATTG CAA CGGATTGTTG 914 
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ATTTTTTCAT CAAGCTCAAT AAAAAGCAAA AAATCGCCCT GATTGCAGCT GGGGTTTTGA 974 
TCACGGCTTT GCTTGTGTTT TTATTGCTCT ATCCCTTTAA AGAAAAAGAC TACACGCAAG 10 3 4 
GGGGTTATGG GGTTTTATTT GAAGGTTTAG ACTCTAGCGA TAACG 107 9 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 



(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 1 ... 40 
(D) OTHER INFORMATION: 



(xi) SEQUENCE 

Met Ala Glu Asn Ser 
-40 

Phe Leu Leu Pro Ala 
-20 

Ala Phe Phe lie Leu 
-5 

Leu Met Asp Asn Ala 
10 

Ser Ser Pro lie Leu 
25 

Ser Ser Gin Phe Val 
45 

Ser Leu Tyr Arg Arg 
60 

Val lie Leu Phe Glu 
75 

Tyr Ser lie Gin Trp 
90 

Ala Leu Ser Leu Val 
105 

His Leu lie Thr His 
125 

Phe Gly Leu lie Phe 
140 

Ser Phe Ser Ser Val 
155 

Cys Phe Ser lie Gly 
170 



DESCRIPTION: SEQ ID 

Phe Lys Asn Val Ser 
-35 

Lys Thr Leu Phe Leu 
-15 

lie Ala Gly Leu Val 
1 

He Phe Asn Phe Ala 
15 

Thr Leu He Leu Gin 
30 

Leu Pro Leu Ser Leu 
50 

Asn Leu Val Leu Gly 
65 

Ala Leu Leu Glu Ser 
80 

Leu Ser Arg Ser Ala 
95 

Leu Phe Tyr Gly Leu 
110 

Gin Thr Leu Lys Asn 
130 

Leu He Gly Leu Ala 
145 

Leu Gly Gly Phe Cys 
160 

He Tyr Leu Ser Val 
175 



NO: 112 : 

Thr Gin Pro Lys Val Phe 
-30 -25 
Leu Gly Gly Val Phe Ser 
-10 

Phe Phe Asp Tyr Ala His 
5 

Arg Ser Thr Pro Phe Asn 
20 

Asn He Ala Asn Leu Gly 
35 40 
Leu Val Gly Val Phe Leu 
55 

Val Trp Phe Val Leu Ser 
70 

Leu Lys His Leu Phe Ala 
85 

Asn Phe Pro Asn Ala Thr 
100 

Leu He Leu Leu He Pro 
115 120 
Val Leu Phe Tyr Ser Leu 
135 

Leu He Val Leu Gly Val 
150 

Leu Gly Ala Leu Gly Ala 
165 

Phe Gin Lys He 
180 



(2) INFORMATION FOR SEQ ID NO: 113: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 97 . . . 912 
(D) OTHER INFORMATION: 



(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 97... 217 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

TTTTATTGAA TGTGTTGTAA TGTTTTTAAG GTATAATAAA CTCTTTTTAA GTCAAGCAAT 6 0 

AAAGTTTGCA ACCTGATGAG AGTAATAATA GAGTTT ATG CTG ATT TCA TTA AAA 114 

Met Leu lie Ser Leu Lys 
-40 -35 

ACA TTC CTA AAA ATA TTA TTG AAA ATA TTC CTA AAA ACC TTC CAA AAG 162 

Thr Phe Leu Lys lie Leu Leu Lys lie Phe Leu Lys Thr Phe Gin Lys 
-30 -25 -20 

ATT TGG GTA GTT TGC GTT ATT ATT TGG GGG TTA GGC TGT AGT TTT TTA 210 

lie Trp Val Val Cys Val lie lie Trp Gly Leu Gly Cys Ser Phe Leu 

-15 -10 -5 

AAC GCT AAC AGC ATT CAA TTA GAA GAA ACG CTC AGA CGA AGC CCT AAA 2 58 

Asn Ala Asn Ser lie Gin Leu Glu Glu Thr Leu Arg Arg Ser Pro Lys 
15 10 

AAT CTT ATT TGG CAA CAC TTT AAA AAG AAG TTT AAA AAG AGC AAC ACG 3 06 

Asn Leu lie Trp Gin His Phe Lys Lys Lys Phe Lys Lys Ser Asn Thr 
15 20 25 30 

ATC CCT TAT GCC CCA AAT AGC CGT TGG AAA TAT TTA GGC ACG AGC ATA 3 54 

lie Pro Tyr Ala Pro Asn Ser Arg Trp Lys Tyr Leu Gly Thr Ser lie 
35 40 45 

GGG ATT TTA GGC GTG TCT TTG GTG ATA GGG ATT GTG GGG CTG TAT CTC 4 02 

Gly lie Leu Gly Val Ser Leu Val lie Gly lie Val Gly Leu Tyr Leu 

50 55 60 

ATG CCA GAG AGC GTA ACG AAT TGG GAT AAA GAA AAG TTT GGG ATC AAA 4 50 

Met Pro Glu Ser Val Thr Asn Trp Asp Lys Glu Lys Phe Gly lie Lys 
65 70 75 
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AGT TGG TTT GAA AAT GTC-CGC ATG GGG CCA AAA CTG GAC AAT GAT AGT 4 98 

Ser Trp Phe Glu Asn Val Arg Met Gly Pro Lys Leu Asp Asn Asp Ser 
80 85 90 

TTT ATT TTT AAT GAA ATT TTG CAC CCT TAT TTT GGG GCT ATG TAT TAT 54 6 

Phe lie Phe Asn Glu He Leu His Pro Tyr Phe Gly Ala Met Tyr Tyr 
95 100 105 HO 

ATG CAA CCG CGC ATG GCT GGA TTT AGC TGG ATG GCA TCA GCG TTT TTT 5 94 

Met Gin Pro Arg Met Ala Gly Phe Ser Trp Met Ala Ser Ala Phe Phe 
115 120 125 

TCT TTT ATC ACT TCC ACG CTT TTT TGG GAA TAT GGC TTG GAA GCG TTT 64 2 

Ser Phe He Thr Ser Thr Leu Phe Trp Glu Tyr Gly Leu Glu Ala Phe 
130 135 140 

GTG GAA GTG CCT AGC TGG CAG GAT TTA GTG ATC ACG CCT TTA TTA GGC 6 90 

Val Glu Val Pro Ser Trp Gin Asp Leu Val He Thr Pro Leu Leu Gly 
145 150 155 

TCC ATT TTA GGG GAG GGG TTT TAT CAG CTC ACG CGC TAT ATC CAA CGC 73 8 

Ser He Leu Gly Glu Gly Phe Tyr Gin Leu Thr Arg Tyr He Gin Arg 
160 165 170 

AAT GAA GGC AAG CTT TTT GGC TCT TTA TTT TTA GGG CGT TTA GTC ATC 7 86 

Asn Glu Gly Lys Leu Phe Gly Ser Leu Phe Leu Gly Arg Leu Val He 
175 180 185 190 

GCT CTT ATG GAT CCT ATC GGT TTT ATC ATT AGG GAT TTA GGA CTT GGG 8 34 

Ala Leu Met Asp Pro He Gly Phe He He Arg Asp Leu Gly Leu Gly 
195 200 205 

GAA GCT TTA GGG ATT TAT AAT AAA CAC GAA ATC CGT TCC AGC TTA AGC 8 82 

Glu Ala Leu Gly He Tyr Asn Lys His Glu He Arg Ser Ser Leu Ser 
210 215 220 

CCC AAT GGT TTG AAT TTG ACT TAC AAA TTT T AAG AG CTTA AAATTTAAGA AAA 93 5 
Pro Asn Gly Leu Asn Leu Thr Tyr Lys Phe 
225 230 

TTATAAAGAG TTTTGATAGA ATACCTT 962 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME /KEY : Signal Sequence 
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( B ) LOCATlUM : i ... 4 0 
(D) OTHER INFORMATION: 



(xi) SEQUENCE 



DESCRIPTION 



SEQ ID 



NO: 114 : 



Met Leu lie Ser Leu Lys Thr Phe Leu Lys lie Leu Leu Lys lie Phe 
-40 -35 -30 -25 

Leu Lys Thr Phe Gin Lys lie Trp Val Val Cys Val lie lie Trp Gly 

-20 -15 -10 

Leu Gly Cys Ser Phe Leu Asn Ala Asn Ser lie Gin Leu Glu Glu Thr 

-5 15 

Leu Arg Arg Ser Pro Lys Asn Leu lie Trp Gin His Phe Lys Lys Lys 

10 15 20 

Phe Lys Lys Ser Asn Thr lie Pro Tyr Ala Pro Asn Ser Arg Trp Lys 
25 30 35 40 

Tyr Leu Gly Thr Ser lie Gly lie Leu Gly Val Ser Leu Val lie Gly 

45 50 55 

lie Val Gly Leu Tyr Leu Met Pro Glu Ser Val Thr Asn Trp Asp Lys 

60 65 70 

Glu Lys Phe Gly lie Lys Ser Trp Phe Glu Asn Val Arg Met Gly Pro 

75 80 85 

Lys Leu Asp Asn Asp Ser Phe lie Phe Asn Glu lie Leu His Pro Tyr 

90 95 100 

Phe Gly Ala Met Tyr Tyr Met Gin Pro Arg Met Ala Gly Phe Ser Trp 
105 110 115 120 

Met Ala Ser Ala Phe Phe Ser Phe lie Thr Ser Thr Leu Phe Trp Glu 

125 130 135 

Tyr Gly Leu Glu Ala Phe Val Glu Val Pro Ser Trp Gin Asp Leu Val 

140 145 150 

lie Thr Pro Leu Leu Gly Ser lie Leu Gly Glu Gly Phe Tyr Gin Leu 

155 160 165 

Thr Arg Tyx' lie Gin Arg Asn Glu Gly Lys Leu Phe Gly Ser Leu Phe 

170 175 180 

Leu Gly Arg Leu Val lie Ala Leu Met Asp Pro lie Gly Phe lie lie 
185 190 195 200 

Arg Asp Leu Gly Leu Gly Glu Ala Leu Gly lie Tyr Asn Lys His Glu 

205 210 215 

lie Arg Ser Ser Leu Ser Pro Asn Gly Leu Asn Leu Thr Tyr Lys Phe 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 
<B) LOCATION: 216.. -1202 



220 



225 



230 
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(U) OTHER INFORMATION: 



(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 216... 273 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

AAATTAAACG AGTTTGGTTT AGAGCCG TAT TTAGGGTTTT TGCACCCCCA TTTAACCAAT 6 0 

GATTTTGAAA ATAACC CTAA TG AG CAATC A GCGCTCTTTG TCTTGCCCCT TTCAGCGGTT 12 0 

AGCGCTCTTA ATGTGCATGC ACTCAAATTT GTGTTGTTGG AAGCGTTACC CTAAAACGCT 18 0 

ATTTTTAAAA TAATCCATTA AAATAAAGGC GAGGA ATG AAA AGA TTT GTT TTG 233 

Met Lys Arg Phe Val Leu 
-15 

TTT TTA TTG TTC ATG TGC GTT TGC GTT CAA GCT TAC GCC GAG CAA GAT 2 81 
Phe Leu Leu Phe Met Cys Val Cys Val Gin Ala Tyr Ala Glu Gin Asp 
-10 -5 1 

TAC TTT TTT AGG GAT TTT AAA TCT AGA GAT TTG CCC CAA AAA CTC CAT 32 9 

Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp Leu Pro Gin Lys Leu His 
5 10 15 

CTT GAT AAA AAG CTC TCC CAA ACA ATA CAG CCA TGC ATG CAA CTT AAC 377 
Leu Asp Lys Lys Leu Ser Gin Thr lie Gin Pro Cys Met Gin Leu Asn 
20 25 30 35 

GCA TCA AAA CAC TAC ACT TCT ACC GGG GTT AGA GAG CCT GAT AAA TGC 42 5 

Ala Ser Lys His Tyr Thr Ser Thr Gly Val Arg Glu Pro Asp Lys Cys 
40 45 50 

ACA AAG AGT TTT AAA AAA TCC GCT CTC ATG TCC TAT GAC TTA GCG CTA 4 73 

Thr Lys Ser Phe Lys Lys Ser Ala Leu Met Ser Tyr Asp Leu Ala Leu 
55 60 65 

GGT TAT TTG GTG AGT AAG AAT AAG CAA TAC GGC TTA AAG GCT ATA GAA 521 
Gly Tyr Leu Val Ser Lys Asn Lys Gin Tyr Gly Leu Lys Ala lie Glu 
70 75 80 

ATT TTA AAC GCT TGG GCT AAA GAG CTT CAA AGC GTG GAT ACT TAT CAG 56 9 

He Leu Asn Ala Trp Ala Lys Glu Leu Gin Ser Val Asp Thr Tyr Gin 
85 90 95 

AGC GAG GAT AAT ATC AAT TTT TAC ATG CCT TAT ATG AAC ATG GCT TAT 617 
Ser Glu Asp Asn He Asn Phe Tyr Met Pro Tyr Met Asn Met Ala Tyr 
100 105 HO H5 

TGG TTT GTC AAA AAG GCG TTT CCT AGC CCA GAA TAT GAA GAT TTC ATT 66 5 

Trp Phe Val Lys Lys Ala Phe Pro Ser Pro Glu Tyr Glu Asp Phe He 
120 125 . 130 

AAG CGG ATG CGC CAG TAT TCT CAA TCA GCT CTT AAC ACT AAC CAT GGG 713 
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Lys Arg Met Arg Gin Tyr Ser Gin Ser Ala Leu Asn Thr Asn His Gly 
135 140 145 

GCG TGG GGC ATT CTT TTT GAT GTG AGT TCT GCG CTA GCG TTA GAC GAT 761 
Ala Trp Gly lie Leu Phe Asp Val Ser Ser Ala Leu Ala Leu Asp Asp 
150 155 160 

AAT GCC CTT TTG CAC AAT AGC GCT AAT CGG TGG CAG GAG TGG GTG TTT 8 09 

Asn Ala Leu Leu His Asn Ser Ala Asn Arg Trp Gin Glu Trp Val Phe 
165 170 175 

AAA GCC ATA GAT GAG AAT GGG GTT ATT GNT AGC GCG ATC ACT AGG AGC 857 
Lys Ala lie Asp Glu Asn Gly Val lie Xaa Ser Ala lie Thr Arg Ser 
180 185 190 195 

GAT ACG AGC GAT TAT CAT GGC GGC CCT ACA AAG GGC ATT AAG GGG ATA 905 
Asp Thr Ser Asp Tyr His Gly Gly Pro Thr Lys Gly lie Lys Gly He 
200 205 210 

GCT TAT ACC AAT TTC GCG CTT CTT GCG CTA ACC ATA TCA GGC GAA TTG 95 3 

Ala Tyr Thr Asn Phe Ala Leu Leu Ala Leu Thr He Ser Gly Glu Leu 
215 220 225 

CTT TTT GAG AAC GGG TAT GAT TTG TGG GGT AGT GGA GCT GGG AAA AGG 10 01 

Leu Phe Glu Asn Gly Tyr Asp Leu Trp Gly Ser Gly Ala Gly Lys Arg 
230 235 240 

CTC TCT GTG GCG TAT AAC AAA GTT GCA ACA TGG ATT TTA AAC CCT GAA 104 9 

Leu Ser Val Ala Tyr Asn Lys Val Ala Thr Trp He Leu Asn Pro Glu 
245 250 255 

ACT TTC CCT TAT TTC CAG CCT AAC CTT ATC GGG GTG CAT AAC AAC GCC 10 97 

Thr Phe Pro Tyr Phe Gin Pro Asn Leu He Gly Val His Asn Asn Ala 
260 265 270 275 

TAT TTC ATT ATT TTA GCC AAG CAT TAT TCT AGC CCT AGT GCA AAT GAG 114 5 

Tyr Phe He He Leu Ala Lys His Tyr Ser Ser Pro Ser Ala Asn Glu 
280 285 290 

CTT TTA AAG CAA GGC GAT TTA CAC GAA GAT GGT TTC AGG CTG AAA CTC 119 3 

Leu Leu Lys Gin Gly Asp Leu His Glu Asp Gly Phe Arg Leu Lys Leu 
295 300 305 

CGA TCG CCA TGAATTTTTC TGTATCCAAG GTT AG C C TTA AGGATGGCCA TGCGCTTTA 12 51 
Arg Ser Pro 
310 

ACCTTTTGAT GAATGGTTCA GAAAGTTTGT TTCAGTCAGC ATTATTTACA AAAAG AG TTT 1311 

AAAATAAACG CAATTGTATC TCTTGAGTCG TCTTTAGAGT GCAAATGATT ATCAAAATGA 13 71 

ATCGTTTTAG TTGTAAGCGT GCTTATTTAC ACTAAAATAA TAAGCGTTAT T 14 2 2 

(2) INFORMATION FOR SEQ ID NO: 116: 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 32 9 amino acids 
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(B) TYPE: amino- acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 1 ... 19 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Met Lys Arg Phe Val Leu Phe Leu Leu Phe Met Cys Val Cys Val Gin 

-15 -10 -5 

Ala Tyr Ala Glu Gin Asp Tyr Phe Phe Arg Asp Phe Lys Ser Arg Asp 

15 10 
Leu Pro Gin Lys Leu His Leu Asp Lys Lys Leu Ser Gin Thr lie Gin 

15 20 25 

Pro Cys Met Gin Leu Asn Ala Ser Lys His Tyr Thr Ser Thr Gly Val 
30 35 40 45 

Arg Glu Pro Asp Lys Cys Thr Lys Ser Phe Lys Lys Ser Ala Leu Met 

50 55 60 

Ser Tyr Asp Leu Ala Leu Gly Tyr Leu Val Ser Lys Asn Lys Gin Tyr 

65 70 75 

Gly Leu Lys Ala He Glu He Leu Asn Ala Trp Ala Lys Glu Leu Gin 

80 85 90 

Ser Val Asp Thr Tyr Gin Ser Glu Asp Asn He Asn Phe Tyr Met Pro 

95 100 105 

Tyr Met Asn Met Ala Tyr Trp Phe Val Lys Lys Ala Phe Pro Ser Pro 
110 H5 120 125 

Glu Tyr Glu Asp Phe He Lys Arg Met Arg Gin Tyr Ser Gin Ser Ala 

130 135 140 

Leu Asn Thr Asn His Gly Ala Trp Gly He Leu Phe Asp Val Ser Ser 

145 150 155 

Ala Leu Ala Leu Asp Asp Asn Ala Leu Leu His Asn Ser Ala Asn Arg 

160 165 170 

Trp Gin Glu Trp Val Phe Lys Ala He Asp Glu Asn Gly Val He Xaa 

175 180 185 

Ser Ala He Thr Arg Ser Asp Thr Ser Asp Tyr His Gly Gly Pro Thr 
190 195 200 205 

Lys Gly He Lys Gly He Ala Tyr Thr Asn Phe Ala Leu Leu Ala Leu 

210 215 220 

Thr He Ser Gly Glu Leu Leu Phe Glu Asn Gly Tyr Asp Leu Trp Gly 

225 230 235 

Ser Gly Ala Gly Lys Arg Leu Ser Val Ala Tyr Asn Lys Val Ala Thr 

240 245 250 

Trp He Leu Asn Pro Glu Thr Phe Pro Tyr Phe Gin Pro Asn Leu He 

255 260 265 

Gly Val His Asn Asn Ala Tyr Phe He He Leu Ala Lys His Tyr Ser 
270 275 280 285 

Ser Pro Ser Ala Asn Glu Leu Leu Lys Gin Gly Asp Leu His Glu Asp 
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290 295 300 

Gly Phe Arg Leu Lys Leu Arg Ser Pro 
305 310 



(2) INFORMATION FOR SEQ ID NO : 117 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 157... 987 
(D) OTHER INFORMATION: 



(A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 157... 226 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

AGCGGTAAAA TCGCTGAAGA AAACAACGCT AAAGAATTTT TTAACCACCC GAAATCTCAA 60 
AG AG C G C AAA AATTTTTAGA AACTTTCCAT TTTTTAGGGA GCTGTTAAAT AAAGTTTGCT 12 0 

AAAAAGATGA TTCTAATTTC AAAAAAAGGT GTTTTT ATG AAA ACA AAC GGG CTT 174 

Met Lys Thr Asn Gly Leu 
-20 

TTT AAA ATG TGG GGG CTG TTT TTA GTT TTA ATC GCT TTA GTC TTT AAT 222 
Phe Lys Met Ti'p Gly Leu Phe Leu Val Leu lie Ala Leu Val Phe Asn 
-15 -10 -5 

GCA TGT TCT GAT AGC CAT AAA GAA AAA AAG GAC GCT TTA GAA GTC ATT 270 
Ala Cys Ser Asp Ser His Lys Glu Lys Lys Asp Ala Leu Glu Val lie 
15 10 15 

AAA CAA AGA GGG GTT TTA AAA GTG GGG GTT TTT AGC GAT AAG CCT CCT 318 
Lys Gin Arg Gly Val Leu Lys Val Gly Val Phe Ser Asp Lys Pro Pro 
20 25 30 

TTT GGC TCT GTG GAT TCT AAA GGG AAA TAT CAA GGC TAT GAT GTA GTT 3 66 

Phe Gly Ser Val Asp Ser Lys Gly Lys Tyr Gin Gly Tyr Asp Val Val 
35 40 45 

ATT GCT AAA CGC ATG GCT CTT GAT TTA TTG GGC GAT GAA AAT AAG ATT 414 
lie Ala Lys Arg Met Ala Leu Asp Leu Leu Gly Asp Glu Asn Lys lie 
50 5 5 60 
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GAG TTT ATT CCT GTA GAA-- GCT TCA GCT AGG GTG GAA TTT TTA AAA GCC 4 62 

Glu Phe He Pro Val Glu Ala Ser Ala Arg Val Glu Phe Leu Lys Ala 
65 70 75 

AAT AAA GTG GAT ATT ATC ATG GCT AAT TTC ACG CGC ACT AAA GAA AGA 510 
Asn Lys Val Asp He He Met Ala Asn Phe Thr Arg Thr Lys Glu Arg 
80 85 90 95 

GAA AAA GTC GTG GAT TTC GCT AAG CCG TAT ATG AAA GTC GCT TTA GGG 558 
Glu Lys Val Val Asp Phe Ala Lys Pro Tyr Met Lys Val Ala Leu Gly 
100 105 HO 

GTG GTT TCT AAA GAT GGG GTC ATT AAA AAT ATA GAA GAG TTG AAA GAT 606 
Val Val Ser Lys Asp Gly Val He Lys Asn lie Glu Glu Leu Lys Asp 
115 120 125 

AAA GAG TTG ATT GTG AAT AAA GGC ACG AC A GCG GAT TTT TAT TTC ACT 654 
Lys Glu Leu He Val Asn Lys Gly Thr Thr Ala Asp Phe Tyr Phe Thr 
130 135 140 

AAA AAT TAC CCC AAT ATC AAG CTT TTG AAA TTT GAG CAA AAT ACA GAG 7 02 

Lys Asn Tyr Pro Asn He Lys Leu Leu Lys Phe Glu Gin Asn Thr Glu 
145 150 155 

ACT TTT TTA GCC CTT TTA AAC AAT AAG GCT ACC GCT CTA GCC CAT GAC 750 
Thr Phe Leu Ala Leu Leu Asn Asn Lys Ala Thr Ala Leu Ala His Asp 
160 165 170 175 

AAC ACT TTA TTG CTC GCT TGG ACG AAA CAA CAC CCT GAA TTT AAA TTA 7 98 

Asn Thr Leu Leu Leu Ala Trp Thr Lys Gin His Pro Glu Phe Lys Leu 
180 185 190 

GGC ATT ACA AGC CTT GGC GAT AAG GAT GTG ATC GCT CCA GCG ATT AAA 84 6 

Gly He Thr Ser Leu Gly Asp Lys Asp Val lie Ala Pro Ala He Lys 
195 200 205 

AAA GGC AAC CCC AAG CTT TTA GAA TGG TTG AAT AAC GAA ATA GAT TCC 8 94 

Lys Gly Asn Pro Lys Leu Leu Glu Trp Leu Asn Asn Glu He Asp Ser 
210 215 220 

CTC ATT TCT AGC GAC TTC TTA AAA GAA GCT TAT CAA GAG ACT TTA GCA 94 2 

Leu lie Ser Ser Asp Phe Leu Lys Glu Ala Tyr Gin Glu Thr Leu Ala 
225 230 235 

CCT GTT TAT GGC GAT GAA ATC AAA CCG GAA GAA ATT ATT TTT GAA TGATT 9 92 
Pro Val Tyr Gly Asp Glu He Lys Pro Glu Glu He He Phe Glu 
240 245 250 

TCTTTAGGCT TTGAATTCTT GACAGGGTGC GTTTTTATTG CTAAATTAGC AATTTTGTGA 10 52 

TCTTTTTGTT TTTCATTTTG AGATATAT 1080 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 277 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 1. . .23 

{ D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Met Lys Thr Asn Gly Leu Phe Lys Met Trp Gly Leu Phe Leu Val Leu 

-20 -15 -10 

lie Ala Leu Val Phe Asn Ala Cys Ser Asp Ser His Lys Glu Lys Lys 

-5 15 
Asp Ala Leu Glu Val lie Lys Gin Arg Gly Val Leu Lys Val Gly Val 
10 15 20 25 

Phe Ser Asp Lys Pro Pro Phe Gly Ser Val Asp Ser Lys Gly Lys Tyr 

30 35 40 

Gin Gly Tyr Asp Val Val lie Ala Lys Arg Met Ala Leu Asp Leu Leu 

45 50 55 

Gly Asp Glu Asn Lys lie Glu Phe lie Pro Val Glu Ala Ser Ala Arg 

60 65 70 

Val Glu Phe Leu Lys Ala Asn Lys Val Asp He He Met Ala Asn Phe 

75 80 85 

Thr Arg Thr Lys Glu Arg Glu Lys Val Val Asp Phe Ala Lys Pro Tyr 
90 95 100 105 

Met Lys Val Ala Leu Gly Val Val Ser Lys Asp Gly Val He Lys Asn 

110 115 120 

He Glu Glu Leu Lys Asp Lys Glu Leu He Val Asn Lys Gly Thr Thr 

125 130 135 

Ala Asp Phe Tyr Phe Thr Lys Asn Tyr Pro Asn He Lys Leu Leu Lys 

140 145 150 

Phe Glu Gin Asn Thr Glu Thr Phe Leu Ala Leu Leu Asn Asn Lys Ala 

155 160 165 

Thr Ala Leu Ala His Asp Asn Thr Leu Leu Leu Ala Trp Thr Lys Gin 
170 175 180 185 

His Pro Glu Phe Lys Leu Gly He Thr Ser Leu Gly Asp Lys Asp Val 

190 195 200 

He Ala Pro Ala He Lys Lys Gly Asn Pro Lys Leu Leu Glu Trp Leu 

205 210 215 

Asn Asn Glu He Asp Ser Leu He Ser Ser Asp Phe Leu Lys Glu Ala 

220 225 230 

Tyr Gin Glu Thr Leu Ala Pro Val Tyr Gly Asp Glu He Lys Pro Glu 

235 240 245 

Glu He He Phe Glu 
250 

(2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1114 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 37... 1050 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

CGAGCTATCA CAACAAATCA ATTTGTAGGA ACAAGC ATG TTT TTT AAA ACT TAT 54 

Met Phe Phe Lys Thr Tyr 
1 5 

CAA AAA TTA TTG GGT GCG AGC TGT TTG ACG TTG TAT TTA GCG GGC TGT 102 
Gin Lys Leu Leu Gly Ala Ser Cys Leu Thr Leu Tyr Leu Ala Gly Cys 
10 15 20 

GGG AGT GAT AGT AGC GAG CCA TTG GTG GGA ATT GAA AAA AAT AGC TTC 15 0 

Gly Ser Asp Ser Ser Glu Pro Leu Val Gly lie Glu Lys Asn Ser Phe 
25 30 . 35 

AAT TCT ACC GTG AAA ATC ATT TCT AAA ACC GAC AAC ATA GAA ATC CAA 198 
Asn Ser Thr Val Lys He He Ser Lys Thr Asp Asn He Glu He Gin 
40 45 50 

GAC TTG AAG CTC AAT CGT GGC AAT TGT GAG CAT GAT CAA AAT TTC TTG 246 
Asp Leu Lys Leu Asn Arg Gly Asn Cys Glu His Asp Gin Asn Phe Leu 
55 60 65 70 

GTA AAG TTA ATC CAA GAA ACA GCC AAT ACA TAC CTG TTT GCA TCA GAA 2 94 

Val Lys Leu He Gin Glu Thr Ala Asn Thr Tyr Leu Phe Ala Ser Glu 
75 80 85 

AAA GAA AAA GCG ATC AAA AAC CAC CAA GCA AAA ATC GCA AGA CTT CAA 342 
Lys Glu Lys Ala He Lys Asn His Gin Ala Lys He Ala Arg Leu Gin 
90 95 100 

AAA GAT TTA GAA GAA CTC ACA CAG CAT GTG CAA CAA TCC AAT AAT CTT 3 90 

Lys Asp Leu Glu Glu Leu Thr Gin His Val Gin Gin Ser Asn Asn Leu 
105 HO H5 

GAT AAA TTG TTA GAA AAT GGA GGA CTA TTC GTT AGT GGC CAT GAT TAT 4 38 

Asp Lys Leu Leu Glu Asn Gly Gly Leu Phe Val Ser Gly His Asp Tyr 
120 125 130 

AAA TAT ACA AAA GAT GAT AAC CCA ATA TAT GTT GTT AAG AGG ATG CTT 4 86 

Lys Tyr Thr Lys Asp Asp Asn Pro He Tyr Val Val Lys Arg Met Leu 
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135 140 - 145 150 

GAT AAC CTT GAT AGC TAT AAA TAT GAA TCA GAC GAC GTG CTA GAC GTG 53 4 

Asp Asn Leu Asp Ser Tyr Lys Tyr Glu Ser Asp Asp Val Leu Asp Val 
155 160 165 

CCA TAT GAG AAG CTA TTG GAA ATA AGC ATT GCT ATT GAA GAC ACT AAA 5 82 

Pro Tyr Glu Lys Leu Leu Glu lie Ser lie Ala lie Glu Asp Thr Lys 
170 175 180 

AAC CCC. AAA GAC TAC CCT TAT ATC AAC CTT AAA GAA CTC AAA AAA TTA 63 0 

Asn Pro Lys Asp Tyr Pro Tyr lie Asn Leu Lys Glu Leu Lys Lys Leu 
185 190 195 

ATA GAT AGT ATT ATT GAT GAT CAT GGT TAT ATG GCC GAT GGC TTT TTG 67 8 

lie Asp Ser lie lie Asp Asp His Gly Tyr Met Ala Asp Gly Phe Leu 
200 205 210 

AAT GAA TAT TCT AAT AGG GTA TCA AAA AAA GGT CTC CAA ATC CTT GCT 72 6 

Asn Glu Tyr Ser Asn Arg Val Ser Lys Lys Gly Leu Gin lie Leu Ala 
215 220 225 230 

AAA CTA AAA TCC ATG TGG CCT AGC GTA GGG AAA TTT TAT TTC GCC TCT 77 4 

Lys Leu Lys Ser Met Trp Pro Ser Val Gly Lys Phe Tyr Phe Ala Ser 
235 240 245 

TTG AAA GAG GCT ATC CCA AGG CAT GCC AAA GAA GTT ACT GAC AAG ATG 822 
Leu Lys Glu Ala lie Pro Arg His Ala Lys Glu Val Thr Asp Lys Met 
250 255 260 

ATT AGC TCT GAA GAA AAA TCT ATC AAA GCC AAT CAA GTC AAA CTC ACT 87 0 

lie Ser Ser Glu Glu Lys Ser lie Lys Ala Asn Gin Val Lys Leu Thr 
265 270 275 

GAA GCG AAG CAA GAT ATT GAC AAA ATG GAA AAA ATC ATT AAA GAT TTA 918 
Glu Ala Lys Gin Asp lie Asp Lys Met Glu Lys lie lie Lys Asp Leu 
280 285 290 

GAA AGC AAG AAA AAC ACC TTA TCA GTG TAT TTA AAA TTT GGA GAA AGT 96 6 

Glu Ser Lys Lys Asn Thr Leu Ser Val Tyr Leu Lys Phe Gly Glu Ser 
295 300 305 310 

TTC AC A GCG CAT TAT AAG TGT CAA AAT CTC ATA GAA GTT GGA GTC AAA 1014 
Phe Thr Ala His Tyr Lys Cys Gin Asn Leu He Glu Val Gly Val Lys 
315 320 325 

ACC GAT AAA GGC TCC TGG ACT TTC AAC TTT AAC AGA TAAATCAGGC AAATAT 10 66 
Thr Asp Lys Gly Ser Trp Thr Phe Asn Phe Asn Arg 
330 335 

GGACAATAGC ACAGACAGAG CAAAAATCCT TATAGAAGAG CTTAAAAT 1114 

(2) INFORMATION FOR SEQ ID NO: 12 0: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 8 amino acids 



8NSDOCID: <WO 9821225A1_L> 
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PCT/US97/21353 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 



Met 


Phe 


Phe 


Lys 


Thr 


Tyr 


Gin 


Lys 


Leu 


Leu 


Gly 


Ala 


Ser 


Cys 


Leu 


Thr 


1 








5 










10 










15 




Leu 


Tyr 


Leu 


Ala 


Gly 


Cys 


Gly 


Ser 


Asp 


Ser 


Ser 


Glu 


Pro 


Leu 


Val 


Gly 








20 










25 










30 






He 


Glu 


Lys 


Asn 


Ser 


Phe 


Asn 


Ser 


Thr 


Val 


Lys 


He 


He 


Ser 


Lys 


Thr 






35 










40 










45 








Asp 


Asn 


He 


Glu 


He 


Gin 


Asp 


Leu 


Lys 


Leu 


Asn 


Arg 


Gly 


Asn 


Cys 


Glu 




50 










55 










60 










His 


Asp 


Gin 


Asn 


Phe 


Leu 


Val 


Lys 


Leu 


He 


Gin 


Glu 


Thr 


Ala 


Asn 


Thr 


65 








70 










75 










80 


Tyr 


Leu 


Phe 


Ala 


Ser 


Glu 


Lys 


Glu 


Lys 


Ala 


He 


Lys 


Asn 


His 


Gin 


Ala 








85 










90 










95 




Lys 


•He 


Ala 


Arg 


Leu 


Gin 


Lys 


Asp 


Leu 


Glu 


Glu 


Leu 


Thr 


Gin 


His 


Val 






100 










105 










110 






Gin 


Gin 


Ser 


Asn 


Asn 


Leu 


Asp 


Lys 


Leu 


Leu 


Glu 


Asn 


Gly 


Gly 


Leu 


Phe 






115 










120 










125 








Val 


Ser 


Gly 


His 


Asp 


Tyr 


Lys 


Tyr 


Thr 


Lys 


Asp 


Asp 


Asn 


Pro 


He 


Tyr 




130 










135 










140 










Val 


Val 


Lys 


Arg 


Met 


Leu 


Asp 


Asn 


Leu 


Asp 


Ser 


Tyr 


Lys 


Tyr 


Glu 


Ser 


145 










150 










155 










160 


Asp 


Asp 


Val 


Leu 


Asp 


Val 


Pro 


Tyr 


Glu 


Lys 


Leu 


Leu 


Glu 


lie 


Ser 


He 










165 










170 










175 




Ala 


He 


Glu 


Asp 


Thr 


Lys 


Asn 


Pro 


Lys 


Asp 


Tyr 


Pro 


Tyr 


He 


Asn 


Leu 








180 










185 










190 






Lys 


Glu 


Leu 


Lys 


Lys 


Leu 


He 


Asp 


Ser 


He 


lie 


Asp 


Asp 


til S 




Tyr 






195 










200 










205 








Met 


Ala 


Asp 


Gly 


Phe 


Leu 


Asn 


Glu 


Tyr 


Ser 


Asn 


Arg 


Val 


Ser 


Lys 


Lys 




210 










215 










220 










Gly 


Leu 


Gin 


He 


Leu 


Ala 


Lys 


Leu 


Lys 


Ser 


Met 


Trp 


Pro 


Ser 


Val 


Gly 


225 










230 










235 










240 


Lys 


Phe 


Tyr 


Phe 


Ala 


Ser 


Leu 


Lys 


Glu 


Ala 


He 


Pro 


Arg 


His 


Ala 


Lys 










245 










250 










255 




Glu 


Val 


Thr 


Asp 


Lys 


Met 


He 


Ser 


Ser 


Glu 


Glu 


Lys 


Ser 


He 


Lys 


Ala 








260 










265 










270 






Asn 


Gin 


Val 


Lys 


Leu 


Thr 


Glu 


Ala 


Lys 


Gin 


Asp 


He 


Asp 


Lys 


Met 


Glu 






275 










280 










285 








Lys 


He 


He 


Lys 


Asp 


Leu 


Glu 


Ser 


Lys 


Lys 


Asn 


Thr 


Leu 


Ser 


Val 


Tyr 




290 










295 










300 










Leu 


Lys 


Phe 


Gly 


Glu 


Ser 


Phe 


Thr 


Ala 


His 


Tyr 


Lys 


Cys 


Gin 


Asn 


Leu 


305 






310 










315 










320 


He 


Glu 


Val 


Gly 


Val 


Lys 


Thr 


Asp 


Lys 


Gly 


Ser 


Trp 


Thr 


Phe 


Asn 


Phe 



325 330 

Asn Arg 



(2) INFORMATION FOR SEQ ID NO: 121: 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 - 



-251- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 40... 1026 
(D) OTHER INFORMATION: 



(A) NAME /KEY : sig_peptide 

(B) LOCATION: 4 0... 99 
(D) OTHER INFORMATION: 



(A) NAME / ICE Y : mat_peptide 

(B) LOCATION: 100... 1026 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

GGTTATACCG AAAAAACAAT ATGAAATCAA GGAGTTTGT ATG CAA CAG CGT CAT 54 

Met Gin Gin Arg His 
-20 

TTA GGC CCT TTA AAA GTG GGT GCA TTA GCT CTA GGG TGC ATG GGC ATG 102 
Leu Gly Pro Leu Lys Val Gly Ala Leu Ala Leu Gly Cys Met Gly Met 
-15 -10 -5 1 

ACT TAT GGG TAT GGG GAA GTC CAT GAT AAA AAG CAG ATG GTT AAA CTT 150 
Thr Tyr Gly Tyr Gly Glu Val His Asp Lys Lys Gin Met Val Lys Leu 
5 10 15 

ATC CAT AAG GCT TTG GAA TTG GGT ATT AAC TTT TTT GAC ACT GCA GAG 198 
lie His Lys Ala Leu Glu Leu Gly He Asn Phe Phe Asp Thr Ala Glu 
20 25 30 

GCT TAT GGG GAA GAT AAT GAA AAG CTT TTA GCG AAG CGA TCA AGC CTT 24 6 

Ala Tyr Gly Glu Asp Asn Glu Lys Leu Leu Ala Lys Arg Ser Ser Leu 
35 40 45 

ATT AAA GAC AAG GTT GTG GTA GCG AGC AAG TTT GGG ATT TAC TAC GCA 2 94 

He Lys Asp Lys Val Val Val Ala Ser Lys Phe Gly He Tyr Tyr Ala 
50 55 60 65 

GAT CCT AAT GAC AAA TAC GCA ACC ATG TTT TTA GAC TCC AGT TCT AAC 3 42 

Asp Pro Asn Asp Lys Tyr Ala Thr Met Phe Leu Asp Ser Ser Ser Asn 
70 75 80 

CGC ATT AAG AGT GCC ATT GAA GGG AGT TTG AAA CGC TTA AAA GTA GAA 3 90 
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Arg He Lys Ser Ala He Glu Gly Ser Leu Lys Arg Leu Lys Val Glu 
85 90 95 

TGC ATT GAT TTA TAC TAC CAA CAC CGC ATG GAT ACT AAC ACG CCC ATA 43 8 

Cys He Asp Leu Tyr Tyr Gin His Arg Met Asp Thr Asn Thr Pro He 
100 105 HO 

GAA GAA GTG GCA GAA GTT ATG CAA GCT CTT ATT AAA GAA GGA AAA ATT 4 86 

Glu Glu Val Ala Glu Val Met Gin Ala Leu He Lys Glu Gly Lys He 
115 120 125 

AAA GCT TGG GGG ATG AGT GAG GCA GGG TTA TCT AGC ATC CAA AAA GCC 5 34 

Lys Ala Trp Gly Met Ser Glu Ala Gly Leu Ser Ser He Gin Lys Ala 
130 135 140 145 

CAT CAA ATT TGC CCT TTA AGC GCG TTG CAG AGC GAA TAT TCC TTG TGG 5 82 

His Gin He Cys Pro Leu Ser Ala Leu Gin Ser Glu Tyr Ser Leu Trp 
150 155 160 

TGG CGC GAA CCT GAA AAA GAG ATT TTA GGT TTT TTA GAA AAA GAA AAA 63 0 

Trp Arg Glu Pro Glu Lys Glu He Leu Gly Phe Leu Glu Lys Glu Lys 
165 170 175 

ATT GGC TTT GTC GCT TTT TCG CCT TTG GGT AAG GGG TTT TTA GGC GCG 67 8 

He Gly Phe Val Ala Phe Ser Pro Leu Gly Lys Gly Phe Leu Gly Ala 
180 185 190 

AAA TTT GAA AAA AAT GCT ACC TTC GCT AGT GAA GAT TTT AGA AGC GTT 72 6 

Lys Phe Glu Lys Asn Ala Thr Phe Ala Ser Glu Asp Phe Arg Ser Val 
195 200 205 



TCT CCT AGG TTT AAT CAA GAA AAT CTA 
Ser Pro Arg Phe Asn Gin Glu Asn Leu 
210 215 

GAA TTA ATC CAA GAT CAT GCA CAC GCT 
Glu Leu He Gin Asp His Ala His Ala 
230 

CTG GCT CTC TCG TGG ATT TTG CAC ACG 
Leu Ala Leu Ser Trp He Leu His Thr 
245 250 

TTT GGC ACC ACC AAA GAA TCC AGG CTC 
Phe Gly Thr Thr Lys Glu Ser Arg Leu 

260 265 
CAG GTT TCT TGG AGT CAA AAA GAA TTG 
Gin Val Ser Trp Ser Gin Lys Glu Leu 
275 280 

ACT GCA ATC AAA ATA GAA GGG GCC CGC 
Thr Ala He Lys He Glu Gly Ala Arg 
290 295 



GCC AAA AAT TAC GTC TTG GTG 7 74 

Ala Lys Asn Tyr Val Leu Val 
220 225 

AAA GGC GTT ACA CCA GCC CAA 82 2 

Lys Gly Val Thr Pro Ala Gin 
235 240 

CAA AAA ATC ATT GTC CCT CTC 870 
Gin Lys He He Val Pro Leu 
255 

ATA GAA AAT ATA GGG GCT TTG 918 
He Glu Asn He Gly Ala Leu 
270 

GAG ATT TTT CAA AAA GAA TTG 96 6 

Glu He Phe Gin Lys Glu Leu 
285 

TAC CCT GAA AGA ATC AAT GAA 1014 
Tyr Pro Glu Arg He Asn Glu 
300 305 
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ATG GTG AAT CAA TAAAAGTATT GGGTATTTAT AATTGCATTG GCTCTTTTAA AAGAG 1071 
Met Val Asn Gin 

ATTGAGCGTT ATTTCCTGTT TGTCAGTGTG 1101 
(2) INFORMATION FOR SEQ ID N0:122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 9 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

Met Gin Gin Arg His Leu Gly Pro Leu Lys Val Gly Ala Leu Ala Leu 
-20 -15 -10 -5 

Gly Cys Met Gly Met Thr Tyr Gly Tyr Gly Glu Val His Asp Lys Lys 

15 10 
Gin Met Val Lys Leu lie His Lys Ala Leu Glu Leu Gly lie Asn Phe 

15 20 25 

Phe Asp Thr Ala Glu Ala Tyr Gly Glu Asp Asn Glu Lys Leu Leu Ala 

30 35 40 

Lys Arg Ser Ser Leu lie Lys Asp Lys Val Val Val Ala Ser Lys Phe 
45 50 55 60 

Gly lie Tyr Tyr Ala Asp Pro Asn Asp Lys Tyr Ala Thr Met Phe Leu 

65 70 75 

Asp Ser Ser Ser Asn Arg lie Lys Ser Ala lie Glu Gly Ser Leu Lys 

80 85 90 

Arg Leu Lys Val Glu Cys lie Asp Leu Tyr Tyr Gin His Arg Met Asp 

95 100 105 

Thr Asn Thr Pro lie Glu Glu Val Ala Glu Val Met Gin Ala Leu lie 

110 115 120 

Lys Glu Gly Lys lie Lys Ala Trp Gly Met Ser Glu Ala Gly Leu Ser 
125 130 135 140 

Ser lie Gin Lys Ala His Gin lie Cys Pro Leu Ser Ala Leu Gin Ser 

145 150 155 

Glu Tyr Ser Leu Trp Trp Arg Glu Pro Glu Lys Glu lie Leu Gly Phe 

160 165 170 

Leu Glu Lys Glu Lys lie Gly Phe Val Ala Phe Ser Pro Leu Gly Lys 

175 180 185 

Gly Phe Leu Gly Ala Lys Phe Glu Lys Asn Ala Thr Phe Ala Ser Glu 

190 195 200 

Asp Phe Arg Ser Val Ser Pro Arg Phe Asn Gin Glu Asn Leu Ala Lys 

205 210 215 220 

Asn Tyr Val Leu Val Glu Leu lie Gin Asp His Ala His Ala Lys Gly 

225 230 235 

Val Thr Pro Ala Gin Leu Ala Leu Ser Trp He Leu His Thr Gin Lys 

240 245 250 

He He Val Pro Leu Phe Gly Thr Thr Lys Glu Ser Arg Leu He Glu 
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255 

Asn He Gly Ala Leu 
270 

Phe Gin Lys Glu Leu 
285 

Glu Arg He Asn Glu 
305 



-254- 

260 

Gin Val Ser Trp Ser Gin 
275 

Thr Ala He Lys lie Glu 
290 295 
Met Val Asn Gin 



265 

Lys Glu Leu Glu He 
280 

Gly Ala Arg Tyr Pro 
300 



(2) INFORMATION FOR SEQ ID NO: 12 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 955 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION : 126 . . . 806 
(D) OTHER INFORMATION: 



(A) NAME /KEY : Signal Sequence 

(B) LOCATION: 126... 237 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

GTCAGCCTTT AAAGGTTTCA TTATAGCAAA GAATATTATT TTTTTATTCC TTGCGTTTTC 6 0 

TGTGCGTTTG TGGGGCAAAT AAGATATAAT CGCCTTTTTA AAATTCATTT TTTAAAGGGG 12 0 

TTTGA ATG GTA TTT GAC AGA ACA ATC AGC GTA AGA GAA AAA AAA GCG GCT 17 0 
Met Val Phe Asp Arg Thr He Ser Val Arg Glu Lys Lys Ala Ala 
-35 -30 -25 

AAA ACG CTT GGG ATT ATT GGG ATC GTC TTT TTT ATT TTG TTT GGC ATC 218 
Lys Thr Leu Gly He He Gly He Val Phe Phe He Leu Phe Gly He 
-20 "15 -10 

GTG ATA AGC GGG GTG GCT TTT CAA AAA GAG TGG GTG CAA CAA TTG GAT 2 66 

Val He Ser Gly Val Ala Phe Gin Lys Glu Trp Val Gin Gin Leu Asp 
-5 15 10 

TTA TTT TTT ATA GAC TTG ATC CGC AAC CCT GCC CCC ATT CAA AAA AGC 314 
Leu Phe Phe He Asp Leu He Arg Asn Pro Ala Pro He Gin Lys Ser 
15 20 25 

GCG TGG CTT TCT TTC GTG TTT TTT AGC ACT TGG TTT GCA CAA AGC AAG 3 62 

Ala Trp Leu Ser Phe Val Phe Phe Ser Thr Trp Phe Ala Gin Ser Lys 
30 35 40 
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CTC ACC ACT CCT ATA GCC TTA CTC ATT GGC TTG TGG TTT GGG TTT CAA 410 
Leu Thr Thr Pro lie Ala Leu Leu lie Gly Leu Trp Phe Gly Phe Gin 
45 50 55 

AAA CGC ATC GCT TTG GGG GTG TGG TTT TTC TTT AGC ATC TTA TTA GGT 4 58 

Lys Arg lie Ala Leu Gly Val Trp Phe Phe Phe Ser lie Leu Leu Gly 
60 65 70 

GAA TTC ACC TTA AAA TCC CTT AAG CTT TTA GTG GCG CGC CCA CGG CCT 50 6 

Glu Phe Thr Leu Lys Ser Leu Lys Leu Leu Val Ala Arg Pro Arg Pro 
75 80 85 90 

GTA ACC AAT GGC GAA TTG GTT TTC GCG CAT GGC TTT AGT TTC CCT AGC 554 
Val Thr Asn Gly Glu Leu Val Phe Ala His Gly Phe Ser Phe Pro Ser 
95 100 105 

GGG CAT GCT TTG GCT TCA GCG CTT TTT TAC GGC TCT TTG GCG TTG TTG 602 
Gly His Ala Leu Ala Ser Ala Leu Phe Tyr Gly Ser Leu Ala Leu Leu 
110 115 120 

TTA TGC TAT TCT AAC GCC AAC AAT CGC ATT AAA ACG ATT ATT GCT GTG 65 0 

Leu Cys Tyr Ser Asn Ala Asn Asn Arg lie Lys Thr lie lie Ala Val 
125 130 135 

GTT TTG CTT TTT TGG ATT TTT TTA ATG GCG TAT GAT AGG GTT TAT TTA 698 
Val Leu Leu Phe Trp lie Phe Leu Met Ala Tyr Asp Arg Val Tyr Leu 
140 145 150 

GGG GTG CAT TAC CCT AGC GAT GTT TTA GGA GGG TTT TTA TTA GGG ATT 74 6 

Gly Val His Tyr Pro Ser Asp Val Leu Gly Gly Phe Leu Leu Gly lie 
155 160 165 170 

GCT TGG TCG TGC TGC TCT TTA GCG CTT TAT TTA GGG TTT TTG AAA CGC 7 94 

Ala Trp Ser Cys Cys Ser Leu Ala Leu Tyr Leu Gly Phe Leu Lys Arg 
175 180 185 

CCT TAT AAT CAA TAAAGGCTTT ATTTAACCAA ACACTGACAA CTAAAATTTT TAAAA 851 
Pro Tyr Asn Gin 
190 

TTCTATTTTT TGATAAAACT CATTCTCTTA AGGGGATAGG GGGTATTTTG CG AT AAT AC C 911 

CCCTTAACCC C CTT AAG AAA CCCCCTAACC CCCAAGACCG CTTT 95 5 

(2) INFORMATION FOR SEQ ID NO:124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 
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( A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 1. . .37 
<D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Met Val Phe Asp Arg Thr He Ser Val Arg Glu Lys Lys Ala Ala Lys 

-35 "30 -25 

Thr Leu Gly He He Gly He Val Phe Phe He Leu Phe Gly lie Val 

-20 "15 -10 

He Ser Gly Val Ala Phe Gin Lys Glu Trp Val Gin Gin Leu Asp Leu 
-5 15 10 

Phe Phe He Asp Leu He Arg Asn Pro Ala Pro He Gin Lys Ser Ala 

15 20 25 

Trp Leu Ser Phe Val Phe Phe Ser Thr Trp Phe Ala Gin Ser Lys Leu 

30 35 40 

Thr Thr Pro He Ala Leu Leu He Gly Leu Trp Phe Gly Phe Gin Lys 

45 50 55 

Arg He Ala Leu Gly Val Trp Phe Phe Phe Ser He Leu Leu Gly Glu 
60 65 70 75 

Phe Thr Leu Lys Ser Leu Lys Leu Leu Val Ala Arg Pro Arg Pro Val 

80 85 90 

Thr Asn Gly Glu Leu Val Phe Ala His Gly Phe Ser Phe Pro Ser Gly 

95 100 105 

His Ala Leu Ala Ser Ala Leu Phe Tyr Gly Ser Leu Ala Leu Leu Leu 

110 H5 120 

Cys Tyr Ser Asn Ala Asn Asn Arg He Lys Thr He He Ala Val Val 

125 130 135 

Leu Leu Phe Trp He Phe Leu Met Ala Tyr Asp Arg Val Tyr Leu Gly 
140 145 150 155 

Val His Tyr Pro Ser Asp Val Leu Gly Gly Phe Leu Leu Gly He Ala 

160 165 170 

Trp Ser Cys Cys Ser Leu Ala Leu Tyr Leu Gly Phe Leu Lys Arg Pro 
175 180 185 

Tyr Asn Gin 
190 



(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1183 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 91... 1032 
(D) OTHER INFORMATION: 
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( A) NAME /KEY : Signal Sequence 

(B) LOCATION: 91... 148 
(D) OTHER INFORMATION : 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 5: 

CTTAAAAGAA ACTTCGCAAA CCTTTTTATA TTATTTTAAA AG C ACT AAT A TTTATTATAT 6 0 

TAGTTACAAC TATTTATTGT AAAGGCTAAA ATG TTG AAA TTT AAA TAT GGT TTG 114 

Met Leu Lys Phe Lys Tyr Gly Leu 
-15 

ATT TAT ATC GCG CTC ATA CTA GGA CTT CAA GCG ACA GAT TAT GAC AAT 162 
lie Tyr lie Ala Leu lie Leu Gly Leu Gin Ala Thr Asp Tyr Asp Asn 
-10 -5 15 

TTA GAA GAA GAA AAC CAA CAA TTA GAT GAA AAA ATA AAC CAT TTA AAG 210 
Leu Glu Glu Glu Asn Gin Gin Leu Asp Glu Lys lie Asn His Leu Lys 
10 15 20 

CAA CAG CTC ACC GAA AAA GGG GTT TCG CCC AAA GAG ATG GAT AAG GAT 2 58 

Gin Gin Leu Thr Glu Lys Gly Val Ser Pro Lys Glu Met Asp Lys Asp 
25 30 35 

AAG TTT GAA GAA GAA TAC ATC AAT CGA TCT TAT CCT AAA ATT TCT TCC 3 06 

Lys Phe Glu Glu Glu Tyr lie Asn Arg Ser Tyr Pro Lys lie Ser Ser 
40 45 50 

AAG AAA AAA GAG AAA TTG CTC AAA TCT TTT TCC ATA GCC GAT GAT AAG 3 54 

Lys Lys Lys Glu Lys Leu Leu Lys Ser Phe Ser lie Ala Asp Asp Lys 
55 60 65 

AGT GGG GTT TTT TTA GGG GGT GGG TAT GCT TAT GGG GAA CTT AAC TTG 4 02 

Ser Gly Val Phe Leu Gly Gly Gly Tyr Ala Tyr Gly Glu Leu Asn Leu 
70 75 80 85 

TCT TAT CAA GGG GAA ATG TTA GAC AGA TAC GGC GCG AAT GCC CCT AGC 4 50 

Ser Tyr Gin Gly Glu Met Leu Asp Arg Tyr Gly Ala Asn Ala Pro Ser 
90 95 100 

GCG TTT AAA AAC AAT ATC AAT ATT AAC GCT CCT GTT TCT ATG ATT AGC 4 98 

Ala Phe Lys Asn Asn lie Asn lie Asn Ala Pro Val Ser Met lie Ser 
105 110 115 

GCT AAA TTT GGG TAT CAA AAA TAC TTT GTG TCT TAT TTT GGG ACA CGA 54 6 

Ala Lys Phe Gly Tyr Gin Lys Tyr Phe Val Ser Tyr Phe Gly Thr Arg 
120 125 130 

TTT TAT GGG GAT TTA TTG CTT GGG GGT GGG GCA TTA AAA GAG GAT GCA 5 94 

Phe Tyr Gly Asp Leu Leu Leu Gly Gly Gly Ala Leu Lys Glu Asp Ala 
135 140 145 

ATC AAG 'CAG CCT GTA GGC TCG TTT ATT TAT GTT TTA GGG GCT GTC AAT 64 2 

lie Lys Gin Pro Val Gly Ser Phe lie Tyr Val Leu Gly Ala Val Asn 
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150 



155 160 165 



ACC GAT TTA TTG TTT GAT ATG CCT TTA GAT TTT AAA ACT AAA AAG CAT 6 90 

Thr Asp Leu Leu Phe Asp Met Pro Leu Asp Phe Lys Thr Lys Lys His 
170 175 180 



TTT TTA GGC GTT TAT GCG GGT TTT GGG ATA GGG CTT ATG CTC TAT CAA 73 8 

Phe Leu Gly Val Tyr Ala Gly Phe Gly lie Gly Leu Met Leu Tyr Gin 
185 190 195 

GAC AGG CCT AAT CAA AAC GGG AGG AAT TTA GTA GTG GGG GGC TAT TCA 7 86 

Asp Arg Pro Asn Gin Asn Gly Arg Asn Leu Val Val Gly Gly Tyr Ser 
200 205 210 

AGC CCT AAT TTT TTA TGG AAA TCT TTG ATT GAA GTG GAT TAC ACT TTT 834 
Ser Pro Asn Phe Leu Trp Lys Ser Leu lie Glu Val Asp Tyr Thr Phe 
215 220 225 

AAT GTG GGC GTG AGT TTA ACG CTT TAT AGG AAA CAC CGT TTA GAG ATT 8 82 

Asn Val Gly Val Ser Leu Thr Leu Tyr Arg Lys His Arg Leu Glu lie 
230 235 240 245 

GGC ACA AAA TTG CCG ATT AGC TAT TTG AGA ATG GGA GTG GAA GAG GGA 93 0 

Gly Thr Lys Leu Pro lie Ser Tyr Leu Arg Met Gly Val Glu Glu Gly 
250 255 260 



GCG ATT TAT CAA AAT AAA GAA GAT GAT GAG CGT TTG TTG GTT TCG GCT 97 8 

Ala lie Tyr Gin Asn Lys Glu Asp Asp Glu Arg Leu Leu Val Ser Ala 
265 270 275 

AAC AAC CAG TTC AAG CGA TCC AGT TTT TTA TTA GTG AAT TAT GCG TTT 102 6 

Asn Asn Gin Phe Lys Arg Ser Ser Phe Leu Leu Val Asn Tyr Ala Phe 
280 285 290 

ATT TTT TAAGGCTTG A TCTTGGAGTT AAGG TTTAAA ATTTTAGCGT TAGTCGTTTT AA 10 84 
lie Phe 
295 

TTTTAGGGGG TTATTTGATT TTTAACGCTT TAATCACAAA ACC CAG AG CT TTAAGTTTTA 1144 
GTTTAAATAG CAAAGAGGGT GCGCTTAATG ACAATGATG 1183 



(2) INFORMATION FOR SEQ ID NO: 12 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 1 ... 19 
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(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 

Met Leu Lys Phe Lys Tyr Gly Leu lie Tyr lie Ala Leu lie Leu Gly 

-15 -10 -5 

Leu Gin Ala Thr Asp Tyr Asp Asn Leu Glu Glu Glu Asn Gin Gin Leu 

15 10 
Asp Glu Lys lie Asn His Leu Lys Gin Gin Leu Thr Glu Lys Gly Val 

15 20 25 

Ser Pro Lys Glu Met Asp Lys Asp Lys Phe Glu Glu Glu Tyr lie Asn 
30 35 40 45 

Arg Ser Tyr Pro Lys lie Ser Ser Lys Lys Lys Glu Lys Leu Leu Lys 

50 55 60 

Ser Phe Ser lie Ala Asp Asp Lys Ser Gly Val Phe Leu Gly Gly Gly 

65 70 75 

Tyr Ala Tyr Gly Glu Leu Asn Leu Ser Tyr Gin Gly Glu Met Leu Asp 

80 85 90 

Arg Tyr Gly Ala Asn Ala Pro Ser Ala Phe Lys Asn Asn He Asn He 

95 100 105 

Asn Ala Pro Val Ser Met He Ser Ala Lys Phe Gly Tyr Gin Lys Tyr 
110 115 120 125 

Phe Val Ser Tyr Phe Gly Thr Arg Phe Tyr Gly Asp Leu Leu Leu Gly 

130 135 140 

Gly Gly Ala Leu Lys Glu Asp Ala He Lys Gin Pro Val Gly Ser Phe 

145 150 155 

He Tyr Val Leu Gly Ala Val Asn Thr Asp Leu Leu Phe Asp Met Pro 

160 165 170 

Leu Asp Phe Lys Thr Lys Lys His Phe Leu Gly Val Tyr Ala Gly Phe 

175 1B0 185 

Gly He Gly Leu Met Leu Tyr Gin Asp Arg Pro Asn Gin Asn Gly Arg 
190 195 200 205 

Asn Leu Val Val Gly Gly Tyr Ser Ser Pro Asn Phe Leu Trp Lys Ser 

210 215 220 

Leu He Glu Val Asp Tyr Thr Phe Asn Val Gly Val Ser Leu Thr Leu 

225 230 235 

Tyr Arg Lys His Arg Leu Glu He Gly Thr Lys Leu Pro He Ser Tyr 

240 245 250 

Leu Arg Met Gly Val Glu Glu Gly Ala He Tyr Gin Asn Lys Glu Asp 

255 260 265 

Asp Glu Arg Leu Leu Val Ser Ala Asn Asn Gin Phe Lys Arg Ser Ser 
270 275 280 285 

Phe Leu Leu Val Asn Tyr Ala Phe He Phe 
290 295 



(2) INFORMATION FOR SEQ ID NO: 12 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 238... 1665 
(D) OTHER INFORMATION: 



(A) NAME /KEY : Signal Sequence 
<B) LOCATION: 23 8... 3 13 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 7: 

GAGCTAGTTT TAAAAAGTTA GTTTTGTTTT AAAAAGTTAA TACTATTTTG AAGCACTCCT 6 0 

ATTCAGATGG CTAAGGCACA CAAGAAATTA GGGGACTCTG CTGTATTCCT ACCCTGAAGC 12 0 

GTTACCCTAA AATCCTATTG CATAGGTCTA AATAAGAGCT TAGGGATCAT TTTAGCCATA 180 

AAAAGCTTAT GTTTTCATTA AAAATG TT AT GATACGCTCA AATAGTCAAG CAAAAAA ATG 24 0 

Met 
-25 

TCA ATT AAA AGG GTT AGA TTG AAA ATA TTC GTT CTG TTG ATG TCG GTA 28 8 
Ser lie Lys Arg Val Arg Leu Lys He Phe Val Leu Leu Met Ser Val 
-20 -15 -10 

ATT TTA GGA ATA TCA TTA ACA GGT TGC ATA GGC TAT CGT ATG GAC TTA 33 6 

He Leu Gly He Ser Leu Thr Gly Cys He Gly Tyr Arg Met Asp Leu 
-5 1 5 

GAA CAT TTT AAC ACG CTC TAT TAT GAA GAA AGC CCT AAA AAA GCT TAT 3 84 

Glu His Phe Asn Thr Leu Tyr Tyr Glu Glu Ser Pro Lys Lys Ala Tyr 
10 15 20 

GAA TAT TCC AAA CAA TTC ACT AAG AAA AAA AAG AAC GCT CTT TTA TGG 4 32 

Glu Tyr Ser Lys Gin Phe Thr Lys Lys Lys Lys Asn . Ala Leu Leu Trp 
25 30 35 40 

GAC TTG CAA AAC GGC TTG AGC GCT TTA TAC GCC AGA GAT TAC CAG ACT 4 80 

Asp Leu Gin Asn Gly Leu Ser Ala Leu Tyr Ala Arg Asp Tyr Gin Thr 
45 50 55 

TCT TTA GGG GTA TTA GAT CAA GCC GAG CAA CGC TTT GAT AAA ACG CAA 52 8 

Ser Leu Gly Val Leu Asp Gin Ala Glu Gin Arg Phe Asp Lys Thr Gin 
60 65 70 

AGC GCT TTT ACA AGA GGG GCT GGT TAT GTG GGC GCT ACC ATG ATT AAT 576 
Ser Ala Phe Thr Arg Gly Ala Gly Tyr Val Gly Ala Thr Met He Asn 
75 80 85 

GAT AAT GTG CGC GCT TAT GGG GGG AAT ATT TAT GAG GGC GTT TTA ATC 62 4 

Asp Asn Val Arg Ala Tyr Gly Gly Asn He Tyr Glu Gly Val Leu He 
90 95 100 
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AAT TAT TAC AAA GCG ATA GAC TAC ATG CTT TTA AAC GAT AGC GCG AAA 672 
Asn Tyr Tyr Lys Ala lie Asp Tyr Met Leu Leu Asn Asp Ser Ala Lys 
105 110 115 120 

GCT AGG GTG CAA TTC AAC CGT GCG AAC GAA CGC CAG CGC AGG GCT AAA 72 0 

Ala Arg Val Gin Phe Asn Arg Ala Asn Glu Arg Gin Arg Arg Ala Lys 
125 130 135 

GAA TTT TAT TAT GAG GAA GTG CAA AAA GCC ATT AAA GAG ATC GAT TCT 76 8 

Glu Phe Tyr Tyr Glu Glu Val Gin Lys Ala lie Lys Glu lie Asp Ser 
140 145 150 

AGC AAA AAG CAC AAT ATT AAT ATG GAA CGC TCT AGG GTG GAA GTG AGC 816 
Ser Lys Lys His Asn lie Asn Met Glu Arg Ser Arg Val Glu Val Ser 
155 160 165 

GAG ATT TTA AAC AAC ACC TAT TCT AAT TTA GAC AAA TAC GAA GCT TAT 8 64 

Glu lie Leu Asn Asn Thr Tyr Ser Asn Leu Asp Lys Tyr Glu Ala Tyr 
170 175 180 

CAG GGC TTA CTT AAC CCG GCG GTT TCG TAT CTC TCA GGG TTG TTT TAC 912 
Gin Gly Leu Leu Asn Pro Ala Val Ser Tyr Leu Ser Gly Leu Phe Tyr 
185 190 195 200 

GCT TTA AAT GGG GAT GAG AAT AAG GGA TTA GGC TAT CTT AAT GAA GCC 960 
Ala Leu Asn Gly Asp Glu Asn Lys Gly Leu Gly Tyr Leu Asn Glu Ala 
205 210 215 

TAT GGG ATC AGT CAA AGC CCT TTT GTA GCC CAA GAC TTG GTT TTT TTC 100 8 

Tyr Gly lie Ser Gin Ser Pro Phe Val Ala Gin Asp Leu Val Phe Phe 
220 225 230 

AAA AAC CCT AAC AGG AGC CAT TTC ACT TGG ATC ATC ATT GAA GAT GGT 105 6 

Lys Asn Pro Asn Arg Ser His Phe Thr Trp lie He He Glu Asp Gly 

235 240 245 

AAA GAG CCG CAA AAA AGC GAA TTT AAA ATT GAT GTG CCT ATT TTT ATG 1104 

Lys Glu Pro Gin Lys Ser Glu Phe Lys He Asp Val Pro He Phe Met 

250 255 260 

ATC GAT TCG GTT TAT AAC GTG AGT ATA GCC TTG CCC AAG CTA GAA AAA 115 2 

He Asp Ser Val Tyr Asn Val Ser lie Ala Leu Pro Lys Leu Glu Lys 
265 270 275 280 

GGG GAA GCG TTT TAT CAA AAT TTC ACT CTC AAA GAT GGA GAA AAA GTA . 12 0 0 
Gly Glu Ala Phe Tyr Gin Asn Phe Thr Leu Lys Asp Gly Glu Lys Val 
285 290 295 

ACG CCC TTT GAC ACT TTA GCC TCA ATA GAT GCG GTG GTC GCT AGC GAA 12 4 8 

Thr Pro Phe Asp Thr Leu Ala Ser He Asp Ala Val Val Ala Ser Glu 
300 305 310 

TTC AGG AAG CAG TTG CCC TAC ATT ATC ACT AGG GCT ATT TTA TCG GCC 12 96 

Phe Arg Lys Gin Leu Pro Tyr He He Thr Arg Ala He Leu Ser Ala 
315 320 325 
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ACT TTT AAA GTG GGC ATG CAA GCG GTG GCG AAC TAT TAT TTG GGG TTT 13 44 

Thr Phe Lys Val Gly Met Gin Ala Val Ala Asn Tyr Tyr Leu Gly Phe 
330 335 340 

GTT GGA GGG TTA GTA ACT TCC TTG TAT TCA GGT GTG AGC ACC TTT GCA 13 92 

Val Gly Gly Leu Val Thr Ser Leu Tyr Ser Gly Val Ser Thr Phe Ala 
345 350 355 360 

GAC ACT AGA AGC ACG AGC ATT TTT GCC CAT AAA ATC TAC CTC ATG CGC 144 0 

Asp Thr Arg Ser Thr Ser lie Phe Ala His Lys He Tyr Leu Met Arg 
365 370 375 

ATT AAA AAC AAA GCC TTT GAA AGT TAT GAA GTT CGA GCC GAT TCC ATT 14 8 8 

He Lys Asn Lys Ala Phe Glu Ser Tyr Glu Val Arg Ala Asp Ser He 
380 385 390 

GAC GCT TTT TCG TTT TCA TTA AAG CCT TGT AAA AGA TCG CTT GAA AGC 153 6 

Asp Ala Phe Ser Phe Ser Leu Lys Pro Cys Lys Arg Ser Leu Glu Ser 
395 400 405 

CCT AAA ATC ATT GAC GCT AGG GAA TTG CTT TCT GGG TTT GTA GCA GCC 15 84 

Pro Lys lie He Asp Ala Arg Glu Leu Leu Ser Gly Phe Val Ala Ala 
410 415 420 

CCA CAA ATC TTT TGC TCT AAC CGC CAT AAT ATT TTA TAC GTG CGC AGT 163 2 

Pro Gin He Phe Cys Ser Asn Arg His Asn He Leu Tyr Val Arg Ser 
425 430 435 440 

TTT AAA AAC GGG TTT GTT TTG AGT CGT TTA AAA TGATTTCAAA ACCCCCACCA 168 5 
Phe Lys Asn Gly Phe Val Leu Ser Arg Leu Lys 
445 450 

AAGGAATTTT AGTTTTTAAG TGTCGTTGGC ATTAAACGCA AACACGATAT AATTATAAAA 174 5 . 
CGATACGAAA ACCTAAATTA AGGGGAAGTC ATGGCTGATA GTTT AG CGGG CATTGATCAA 1805 
GTTACGAGTT TGCATAAAAA TAACGAGTTA CAATTGTTGT GTTTCA 1851 

(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 1 ... 25 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
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Met 


Ser 


He 


Lys 


Arg 


Val 


Arg 


Leu 


Lys 


He 


Phe 


Val 


Leu 


Leu 


Met 


Ser 


-25 










-20 










-15 










-10 


Val 


He 


Leu 


Gly 


He 


Ser 


Leu 


Thr 


Gly 


Cys 
1 


He 


Gly 


Tyr 


Arg 
5 


Met 


Asp 


Leu 


Glu 


His 


Phe 


-5 
Asn 


Thr 


Leu 


Tyr 


Tyr 


Glu 


Glu 


Ser 


Pro 


Lys 


Lys 


Ala 






10 










15 










20 








Tyr 


Glu 


Tyr 


Ser 


Lys 


Gin 


Phe 


Thr 


Lys 


Lys 


Lys 


Lys 


Asn 


Ala 


Leu 


Leu 




25 










30 










35 










Trp 


Asp 


Leu 


Gin 


Asn 


Gly 


Leu 


Ser 


Ala 


Leu 


Tyr 


Ala 


Arg 


Asp 


Tyr 


Gin 


40 










45 










50 










55 


Thr 


Ser 


Leu 


Gly 


Val 


Leu 


Asp 


Gin 


Ala 


Glu 


Gin 


Arg 


Phe 


Asp 


Lys 


Thr 










60 










65 










70 




Gin 


Ser 


Ala 


Phe 


Thr 


Arg 


Gly 


Ala 


Gly 


Tyr 


Val 


Gly 


Ala 


Thr 


Met 


He 








75 










80 










85 






Asn 


Asp 


Asn 


Val 


Arg 


Ala 


Tyr 


Gly 


Gly 


Asn 


He 


Tyr 


Glu 


Gly 


Val 


Leu 






90 










95 










100 








He 


Asn 


Tyr 


Tyr 


Lys 


Ala 


He 


Asp 


Tyr 


Met 


Leu 


Leu 


Asn 


Asp 


Ser 


Ala 




105 










110 










115 










Lys 


Ala 


Arg 


Val 


Gin 


Phe 


Asn 


Arg 


Ala 


Asn 


Glu 


Arg 


Gin 


Arg 


Arg 


Ala 


120 










125 










130 










135 


Lys 


Glu 


Phe 


Tyr 


Tyr 


Glu 


Glu 


Val 


Gin 


Lys 


Ala 


He 


Lys 


Glu 


He 


Asp 










140 










145 










150 




Ser 


Ser 


Lys 


Lys 


His 


Asn 


He 


Asn 


Met 


Glu 


Arg 


Ser 


Arg 


Val 


Glu 


Val 








155 










160 










165 






Ser 


Glu 


He 


Leu 


Asn 


Asn 


Thr 


Tyr 


Ser 


Asn 


Leu 


Asp 


Lys 


Tyr 


Glu 


Ala 






170 










175 










180 








Tyr 


Gin 


Gly 


Leu 


Leu 


Asn 


Pro 


Ala 


Val 


Ser 


Tyr 


Leu 


Ser 


Gly 


Leu 


Phe 




185 










190 










195 










Tyr 


Ala 


Leu 


Asn 


Gly 


Asp 


Glu 


Asn 


Lys 


Gly 


Leu 


Gly 


Tyr 


Leu 


Asn 


Glu 


200 










205 










210 










215 


Ala 


Tyr 


Gly 


He 


Ser 


Gin 


Ser 


Pro 


Phe 


Val 


Ala 


Gin 


Asp 


Leu 


Val 


Phe 










220 










225 










230 




Phe 


Lys 


Asn 


Pro 


Asn 


Arg 


Ser 


His 


Phe 


Thr 


Trp 


He 


He 


He 


Glu 


Asp 








235 










240 










245 






Gly 


Lys 


Glu 


Pro 


Gin 


Lys 


Ser 


Glu 


Phe 


Lys 


He 


Asp 


Val 


Pro 


He 


Phe 






250 










255 










260 








Met 


He 


Asp 


Ser 


Val 


Tyr 


Asn 


Val 


Ser 


He 


Ala 


Leu 


Pro 


Lys 


Leu 


Glu 




265 










270 










275 










Lys 


Gly 


Glu 


Ala 


Phe 


Tyr 


Gin 


Asn 


Phe 


Thr 


Leu 


Lys 


Asp 


Gly 


Glu 


Lys 


280 










285 










290 










295 


Val 


Thr 


Pro 


Phe 


Asp 


Thr 


Leu 


Ala 


Ser 


He 


Asp 


Ala 


Val 


Val 


Ala 


Ser 










300 










305 










310 




Glu 


Phe 


Arg 


Lys 


Gin 


Leu 


Pro 


Tyr 


He 


He 


Thr 


Arg 


Ala 


He 


Leu 


Ser 








315 










320 










325 






Ala 


Thr 


Phe 


Lys 


Val 


Gly 


Met 


Gin 


Ala 


Val 


Ala 


Asn 


Tyr 


Tyr 


Leu 


Gly 






330 










335 










340 








Phe 


Val 


Gly 


Gly 


Leu 


Val 


Thr 


Ser 


Leu 


Tyr 


Ser 


Gly 


Val 


Ser 


Thr 


Phe 




345 










350 










355 










Ala 


Asp 


Thr 


Arg 


Ser 


Thr 


Ser 


He 


Phe 


Ala 


His 


Lys 


He 


Tyr 


Leu 


Met 


360 










365 










370 










375 


Arg 


He 


Lys 


Asn 


Lys 


Ala 


Phe 


Glu 


Ser 


Tyr 


Glu 


Val 


Arg 


Ala 


Asp 


Ser 










380 










385 










390 




He 


Asp 


Ala 


Phe 


Ser 


Phe 


Ser 


Leu 


Lys 


Pro 


Cys 


Lys 


Arg 


Ser 


Leu 


Glu 








395 










400 










405 






Ser 


Pro 


Lys 


He 


He 


Asp 


Ala 


Arg 


Glu 


Leu 


Leu 


Ser 


Gly 


Phe 


Val 


Ala 



BNSDOCID: <WO 9621225A1 I > 



WO 98/21225 PCT/US97/21353 - 

-264- 

410 415 420 

Ala Pro Gin He Phe Cys Ser Asn Arg His Asn He Leu Tyr Val Arg 

425 430 435 

Ser Phe Lys Asn Gly Phe Val Leu Ser Arg Leu Lys 
440 445 450 



(2) INFORMATION FOR SEQ ID NO: 12 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1 ... 432 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

ATG TTA GAA AAA TTG ATT GAA AGA GTG TTG TTT GCC ACT CGT TGG TTG 4 8 

Met Leu Glu Lys Leu lie Glu Arg Val Leu Phe Ala Thr Arg Trp Leu 
1 5 10 15 

CTA GCC CCT TTA TGT ATT GCC ATG TCG TTA GTG CTG GTG GTT TTA GGC 96 
Leu Ala Pro Leu Cys He Ala Met Ser Leu Val Leu Val Val Leu Gly 
20 25 30 

TAT GTG TTC ATG AAA GAG TTG TGG CAC ATG CTC AGC CAT TTA AAC ACG 14 4 

Tyr Val Phe Met Lys Glu Leu Trp His Met Leu Ser His Leu Asn Thr 
35 40 45 

ATC AGC GAA ACG GAT TTG GTT TTA TCA GCC TTA GGA TTA GTG GAT TTG 192 
He Ser Glu Thr Asp Leu Val Leu Ser Ala Leu Gly Leu Val Asp Leu 
50 55 60 

TTG TTT ATG GCC GGG CTT GTT TTA ATG GTG TTA CTC GCC AGT TAT GAA 24 0 

Leu Phe Met Ala Gly Leu Val Leu Met Val Leu Leu Ala Ser Tyr Glu 
65 *70 75 80 

AGC TTT GTT TCT AAA TTA GAC AAG GTG GAT GCC AGT GAA ATC ACT TGG 2 88 

Ser Phe Val Ser Lys Leu Asp Lys Val Asp Ala Ser Glu He Thr Trp 
85 90 95 

CTA AAG CAC ACG GAT TTT AAC GCT TTA AAA TTA AAG GTT TCA CTC TCC 33 6 

Leu Lys His Thr Asp Phe Asn Ala Leu Lys Leu Lys Val Ser Leu Ser 



100 



105 HO 



ATT GTA GCG ATT TCA GCG ATT TTC TTG CTC AAA CGC TAC ATG AGT TTA 3 84 

He Val Ala lie Ser Ala He Phe Leu Leu Lys Arg Tyr Met Ser Leu 
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115 120 125 

GAA AGA TGT TTT ATC CCA GCA TTC CCT AAG GAT ACG CCC CCT ATC GCA T 4 33 
Glu Arg Cys Phe lie Pro Ala Phe Pro Lys Asp Thr Pro Pro lie Ala 
130 135 140 

AA 435 

(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 

Met Leu Glu Lys Leu lie Glu Arg Val Leu Phe Ala Thr Arg Trp Leu 

15 10 15 

Leu Ala Pro Leu Cys lie Ala Met Ser Leu Val Leu Val Val Leu Gly 

20 25 30 

Tyr Val Phe Met Lys Glu Leu Trp His Met Leu Ser His Leu Asn Thr 

35 40 45 

lie Ser Glu Thr Asp Leu Val Leu Ser Ala Leu Gly Leu Val Asp Leu 

50 55 60 

Leu Phe Met Ala Gly Leu Val Leu Met Val Leu Leu Ala Ser Tyr Glu 
65 70 75 80 

Ser Phe Val Ser Lys Leu. Asp Lys Val Asp Ala Ser Glu lie Thr Trp 
85 90 95 

Leu Lys His Thr Asp Phe Asn Ala Leu Lys Leu Lys Val Ser Leu Ser 

100 105 110 

lie Val Ala lie Ser Ala lie Phe Leu Leu Lys Arg Tyr Met Ser Leu 

115 120 125 

Glu Arg Cys Phe lie Pro Ala Phe Pro Lys Asp Thr Pro Pro lie Ala 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 213... 2081 
(D) OTHER INFORMATION: 
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( A) NAME / KEY : Signal Sequence 

(B) LOCATION: 213... 273 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

AT C AT AAAAT GTAAAAATAC TCAAAGCATC GCATCAAGCA ATATAG CGAT CTGAAAAGAG 6 0 

GCTCACAATT GAGCTAAAGC CCGCTTTTTA GGGATAAATA AAAAGCGTTT TCAAATTG CA 12 0 

TGGGTAACTT TATGGGGCGA AGCGTTTCTA AATTTTGGTA TAATCGCTAG AAATTGTGAG 180 

AAAGATTCTA TCTTGTTTGA GTGGGGTTTC GC ATG CGT TTA TTA TTG TGG TGG 2 33 

Met Arg Leu Leu Leu Trp Trp 
-20 -15 

GTA TTG GTA TTA TCG CTC TTT TTA AAT CCT TTG AGA GCG GTT GAA GAG 2 81 

Val Leu Val Leu Ser Leu Phe Leu Asn Pro Leu Arg Ala Val Glu Glu 
-10 -5 1 

CAT GAA ACA GAT GCG GTG GAT TTG TTT TTG ATT TTC AAT CAA ATC AAC 32 9 

His Glu Thr Asp Ala Val Asp Leu Phe Leu He Phe Asn Gin He Asn 
5 10 15 

CAG CTC AAT CAA GTC ATT GAA ACT TAC AAA AAA AAC CCT GAA AGA AGC 3 77 

Gin Leu Asn Gin Val He Glu Thr Tyr Lys Lys Asn Pro Glu Arg Ser 
20 25 30 35 

GCT GAA ATC TCT CTG TAT AAC ACC CAA AAG AAT GAC TTG ATT AAA AGT 42 5 

Ala Glu He Ser Leu Tyr Asn Thr Gin Lys Asn Asp Leu He Lys Ser 
40 45 50 

TTG ACT TCT AAA GTG TTG AAT GAA AGG GAT AAG ATC GGG ATT GAT ATC 47 3 

Leu Thr Ser Lys Val Leu Asn Glu Arg Asp Lys He Gly He Asp He 
55 60 . 65 

AAT CAA AAT TTA AAA GAG CAG GAA AAA ATC AAA AAG CGT TTG TCT AAA 521 
Asn Gin Asn Leu Lys Glu Gin Glu Lys He Lys Lys Arg Leu Ser Lys 
70 75 80 

AGC ATT AAT GGC GAT GAT TTC TAC ACT TTC ATG AAA GAC AGA TTG TCT 56 9 

Ser He Asn Gly Asp Asp Phe Tyr Thr Phe Met Lys Asp Arg Leu Ser 
85 90 95 

TTA GAT ATT TTG TTG ATA GAT GAA ATT TTG TAT CGT TTT ATA GAT AAA 617 
Leu Asp He Leu Leu He Asp Glu He Leu Tyr Arg Phe He Asp Lys 
100 105 HO H5 

ATC AGG AGC AGT ATT GAT ATT TTT AGC GAA CAA AAA GAT GTA GAA AGC 6 65 

He Arg Ser Ser He Asp He Phe Ser Glu Gin Lys Asp Val Glu Ser 
120 125 130 

ATC AGC GAT GCT TTC CTT TTG CGT TTA GGG CAA TTC AAA CTC TAC ACT 713 



BNSDOCIO: <WO 9821 225A1_I_> 



WO 98/21225 



PCT/US97/21353 - 



-267- 

Ile Ser Asp Ala Phe Leu Leu Arg Leu Gly Gin Phe Lys Leu Tyr Thr 
135 140 145 

TTC CCT AAA AAT TTA GGC AAT GTC AAA ATG CAT GAA TTA GAG CAG ATG 761 
Phe Pro Lys Asn Leu Gly Asn Val Lys Met His Glu Leu Glu Gin Met 
150 155 160 

TTT AGC GAT TAT GAA TTG CGT TTG AAC ACT TAC ACC GAA GTC TTG CGT 809 
Phe Ser Asp Tyr Glu Leu Arg Leu Asn Thr Tyr Thr Glu Val Leu Arg 
165 170 175 

TAC ATT AAA AAC CAC CCT AAA GAA GTG CTT CCT AAA AAC TTG ATC ATG 857 
Tyr lie Lys Asn His Pro Lys Glu Val Leu Pro Lys Asn Leu lie Met 
180 185 190 195 

GAA GTG AAT ATG GAT TTT GTG TTA AAC AAA ATC AGC AAG GTT TTG CCT 905 
Glu Val Asn Met Asp Phe Val Leu Asn Lys He Ser Lys Val Leu Pro 
200 205 210 

TTC ACA ACC CAT AGC TTG CAA GTG AGT AAA ATC GTG CTA GCT TTG ACG 95 3 

Phe Thr Thr His Ser Leu Gin Val Ser Lys He Val Leu Ala Leu Thr 
215 220 225 

ATT TTA GCC TTA TTG CTG GGT TTA AGG AAG TTG ATC ACT TGG CTT TTA 10 01 

He Leu Ala Leu Leu Leu Gly Leu Arg Lys Leu He Thr Trp Leu Leu 
230 235 240 

GCC TTA TTG TTA GAT CGT ATT TTT GAA ATC ATG CAG CGC AAT AAA AAA 104 9 

Ala Leu Leu Leu Asp Arg He Phe Glu He Met Gin Arg Asn Lys Lys 
245 250 255 

ATG CAT GTC AAT GTG CAA AAG AGC ATT GTT TCG CCG GTT TCT GTC TTT 10 97 

Met His Val Asn Val Gin Lys Ser He Val Ser Pro Val Ser Val Phe 
260 265 270 275 

TTA GCC CTA TTT AGT TGC GAT GTG GCT TTA GAT ATT TTC TAC TAC CCT 1145 
Leu Ala Leu Phe Ser Cys Asp Val Ala Leu Asp He Phe Tyr Tyr Pro 
280 285 290 

AAC GCA TCG CCC CCT AAA GTT TCT ATG TGG GTG GGC GCG GTG TAT ATC 1193 
Asn Ala Ser Pro Pro Lys Val Ser Met Trp Val Gly Ala Val Tyr He 
295 300 305 

ATG CTT TTA GCA TGG TTA GTG ATA GCG CTT TTT AAA GGC TAT GGG GAA 12 41 

Met Leu Leu Ala Trp Leu Val He Ala Leu Phe Lys Gly Tyr Gly Glu 
310 315 320 

GCG TTA GTT ACG AAT ATG GCT ACC AAA AGC ACG CAC AAT TTT AGA AAA 12 8 9 

Ala Leu Val Thr Asn Met Ala Thr Lys Ser Thr His Asn Phe Arg Lys 
325 330 335 

GAA GTG ATC AAC TTG ATT TTA AAA GTC GTG TAT TTT TTG ATC TTT ATT 13 3 7 

Glu Val He Asn Leu He Leu Lys Val Val Tyr Phe Leu He Phe He 
340 345 350 355 
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Val Ala Leu Leu Gly vai ^eu xjy 37Q 
360 365 

ATC ATC GCT TCT TTA GGG ATT GGG GGG TTA GCG GTG GOT TTG GCG GTT 
S lit SI Ser Leu Gly He Gly Gly Leu Ala Val Ala Leu Ala Val 



375 



380 



AAA GAT GTG TTA GCG AAT TTT TTT GCT TCG GTC ATT TTA TTA TTA GAC 
i£ IZ Val Leu Ala Asn Phe Phe Ala Ser Val lie Leu Leu Leu Asp 



390 395 



1385 



1433 



1481 



1529 



AAT TCG TTT TCT CAA GGG GAT TGG ATC GTG TGC GGT GAA GTG GAG GGC 
Asn Ser Phe Ser Gin Gly Asp Trp He Val Cys Gly Glu Val Glu Gly 
405 410 415 

ACG GTG GTG GAA ATG GGG TTA AGG CGC ACC ACG ATC AGA GCC TTT GAC 157 7 

Thr Val Val Glu Met Gly Leu Arg Arg Thr Thr lie Arg Ala Phe Asp 
420 425 430 435 

AAC GCT CTT TTG TCC GTG CCT AAT TCA GAA TTA GCC GGA AAA CCC ATC 162 5 

Asn Ala Leu Leu Ser Val Pro Asn Ser Glu Leu Ala Gly Lys Pro He 

440 445 450 

AGG AAT TGG AGC CGT CGT AAA GTG GGA AGG CGT ATT AAA ATG GAA ATA 167 3 

Arg Asn Trp Ser Arg Arg Lys Val Gly Arg Arg He Lys Met Glu He 
455 460 465 

GGC TTA ACT TAT AGC TCC AGT CAA AGC GCT TTA CAG CTT TGC GTG AAA 1721 
Gly Leu Thr Tyr Ser Ser Ser Gin Ser Ala Leu Gin Leu Cys Val Lys 
470 475 480 

GAC ATT AAA GAA ATG TTA GAA AAC CAC CCT AAA ATC GCT AAC GGA GCC 1769 
Asp He Lys Glu Met Leu Glu Asn His Pro Lys He Ala Asn Gly Ala 
485 490 495 

GAT AGC GCT TTG CAA AAT GTG AGC GAT TAC CGC TAC ATG TTT AAA AAA 1817 
Asp Ser Ala Leu Gin Asn Val Ser Asp Tyr Arg Tyr Met Phe Lys Lys 
500 505 510 515 

GAT ATT GTT TCT ATT GAT GAT TTT TTA GGG TAT AAA AAC AAT TTG TTT 18 65 

Asp He Val Ser He Asp Asp Phe Leu Gly Tyr Lys Asn Asn Leu Phe 
520 525 530 

GTC TTT TTA GAT CAG TTT GCG GAC AGC TCT ATT AAT ATT TTA GTG TAT 1913 
Val Phe Leu Asp Gin Phe Ala Asp Ser Ser He Asn He Leu Val Tyr 
535 540 545 

TGC TTT TCT AAG ACA GTG GTT TGG GAA GAG TGG CTA GAA GTC AAA GAA 1961 
Cys Phe Ser Lys Thr Val Val Trp Glu Glu Trp Leu Glu Val Lys Glu 
550 555 560 

GAT GTG ATG CTA AAA ATC ATG GGG ATT GTA GAA AAG CAC CAT TTG AGT 2 009 

Asp Val Met Leu Lys He Met Gly He Val Glu Lys His His Leu Ser 
565 570 575 
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TTT GCT TTC CCA TCA CAG AGT TTG TAT GTG GAG AGT TTG CCA GAA GTT 2 05 7 

Phe Ala Phe Pro Ser Gin Ser Leu Tyr Val Glu Ser Leu Pro Glu Val 
580 585 590 595 

AGC CTG AAA GAA GGG GCT AAA ATC TGAAATTATT GGTAGATGTA TTCTTTGGTT 2111 
Ser Leu Lys Glu Gly Ala Lys lie 
600 



Met 


Arg 


Leu 


Leu 


Leu 


Trp 


Trp 


Val 


Leu 


Val 


Leu 


Ser 


Leu 


Phe 


Leu 


Asn 


-20 










-15 










-10 










-5 


Pro 


Leu 


Arg 


Ala 


Val 


Glu 


Glu 


His 


Glu 


Thr 


Asp 


Ala 


Val 


Asp 
10 


Leu 


Phe 


Leu 


lie 


Phe 


Asn 


1 

Gin 


He 


Asn 


Gin 


5 

Leu 


Asn 


Gin 


Val 


He 


Glu 


Thr 


Tyr 






15 










20 










25 








Lys 


Lys 


Asn 


Pro 


Glu 


Arg 


Ser 


Ala 


Glu 


He 


Ser 


Leu 


Tyr 


Asn 


Thr 


Gin 




30 










35 










40 










Lys 


Asn 


Asp 


Leu 


He 


Lys 


Ser 


Leu 


Thr 


Ser 


Lys 


Val 


Leu 


Asn 


Glu 


Arg 


45 










50 










55 










60 


Asp 


Lys 


He 


Gly 


He 


Asp 


He 


Asn 


Gin 


Asn 


Leu 


Lys 


Glu 


Gin 


Glu 


Lys 








65 










70 










75 




lie 


Lys 


Lys 


Arg 


Leu 


Ser 


Lys 


Ser 


He 


Asn 


Gly 


Asp 


Asp 


Phe 


Tyr 


Thr 






80 










85 










90 






Phe 


Met 


Lys 


Asp 


Arg 


Leu 


Ser 


Leu 


Asp 


He 


Leu 


Leu 


He 


Asp 


Glu 


He 






95 










100 










105 








Leu 


Tyr 


Arg 


Phe 


He 


Asp 


Lys 


He 


Arg 


Ser 


Ser 


He 


Asp 


He 


Phe 


Ser 




110 










115 










120 










Glu 


Gin 


Lys 


Asp 


Val 


Glu 


Ser 


He 


Ser 


Asp 


Ala 


Phe 


Leu 


Leu 


Arg 


Leu 


125 










130 










135 










140 


Gly 


Gin 


Phe 


Lys 


Leu 


Tyr 


Thr 


Phe 


Pro 


Lys 


Asn 


Leu 


Gly 


Asn 


Val 


Lys 










145 










150 










155 




Met 


His 


Glu 


Leu 


Glu 


Gin 


Met 


Phe 


Ser 


Asp 


Tyr 


Glu 


Leu 


Arg 


Leu 


Asn 



2234 



AAGGGGAAAG TGTTATCCAC GCTGTTGGTT AAAAGCAATT GGAATAAATC CGCGCTCCCC 2171 

ACCCTAAAGG CGGATGCGCA AGT C CTT AAA TACAGATCCC ACATGCGGAT AAAG CGTTCG 22 31 
TCA 

(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME / KEY : Signal Sequence 

(B) LOCATION: 1 ... 20 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 2: 
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160 

Thr Tyr Thr Glu 
175 

Leu Pro Lys Asn 
190 

Lys lie Ser Lys 
205 

Lys lie Val Leu 

Lys Leu lie Thr 
240 

He Met Gin Arg 
255 

Val Ser Pro Val 
270 

Leu Asp He Phe 
285 

Trp Val Gly Ala 

Leu Phe Lys Gly 
320 

Ser Thr His Asn 
335 

Val Tyr Phe Leu 
350 

Leu Gly Phe Asn 
365 

Leu Ala Val Ala 

Ser Val He Leu 
400 

Val Cys Gly Glu 
415 

Thr Thr He Arg 
430 

Glu Leu Ala Gly 
445 

Arg Arg He Lys 

Ala Leu Gin Leu 
480 

Pro Lys He Ala 
495 

Tyr Arg Tyr Met 
510 

Gly Tyr Lys Asn 
525 

Ser He Asn He 

Glu Trp Leu Glu 
560 

Val Glu Lys His 
575 

Val Glu Ser Leu 
590 



Val Leu Arg Tyr 
180 

Leu He Met Glu 
195 

Val Leu Pro Phe 
210 

Ala Leu Thr He 
225 

Trp Leu Leu Ala 

Asn Lys Lys Met 
260 

Ser Val Phe Leu 
275 

Tyr Tyr Pro Asn 
290 

Val Tyr He Met 
305 

Tyr Gly Glu Ala 

Phe Arg Lys Glu 
340 

He Phe He Val 
355 

Val Ser Ala He 
370 

Leu Ala Val Lys 
385 

Leu Leu Asp Asn 

Val Glu Gly Thr 
420 

Ala Phe Asp Asn 
435 

Lys Pro He Arg 
450 

Met Glu He Gly 
465 

Cys Val Lys Asp 

Asn Gly Ala Asp 
500 

Phe Lys Lys Asp 
515 

Asn Leu Phe Val 
530 

Leu Val Tyr Cys 
545 

Val Lys Glu Asp 

His Leu Ser Phe 
580 

Pro Glu Val Ser 
595 



-270- 

165 

He Lys Asn His 

Val Asn Met Asp 
200 

Thr Thr His Ser 
215 

Leu Ala Leu Leu 
230 

Leu Leu Leu Asp 
245 

His Val Asn Val 

Ala Leu Phe Ser 
280 

Ala Ser Pro Pro 
295 

Leu Leu Ala Trp 
310 

Leu Val Thr Asn 
325 

Val He Asn Leu 

Ala Leu Leu Gly 
360 

He Ala Ser Leu 
375 

Asp Val Leu Ala 
390 

Ser Phe Ser Gin 
405 

Val Val Glu Met 

Ala Leu Leu Ser 
440 

Asn Trp Ser Arg 
455 

Leu Thr Tyr Ser 
470 

He Lys Glu Met 
485 

Ser Ala Leu Gin 

He Val Ser He 
520 

Phe Leu Asp Gin 
535 

Phe Ser Lys Thr 
550 

Val Met Leu Lys 
565 

Ala Phe Pro Ser 

Leu Lys Glu Gly 
600 



170 

Pro Lys Glu Val 
185 

Phe Val Leu Asn 

Leu Gin Val Ser 
220 

Leu Gly Leu Arg 
235 

Arg He Phe Glu 
250 

Gin Lys Ser He 
265 

Cys Asp Val Ala 

Lys Val Ser Met 
300 

Leu Val He Ala 
315 

Met Ala Thr Lys 
330 

He Leu Lys Val 
345 

Val Leu Lys Gin 

Gly He Gly Gly 
380 

Asn Phe Phe Ala 
395 

Gly Asp Trp He 
410 

Gly Leu Arg Arg 
425 

Val Pro Asn Ser 

Arg Lys Val Gly 
460 

Ser Ser Gin Ser 
475 

Leu Glu Asn His 
490 

Asn Val Ser Asp 
505 

Asp Asp Phe Leu 

Phe Ala Asp Ser 
540 

Val Val Trp Glu 
555 

He Met Gly He 
570 

Gin Ser Leu Tyr 
585 

Ala Lys He 
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(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 1...429 
(D) OTHER INFORMATION: 



(A) NAME / KEY : sig_peptide 

(B) LOCATION: 1 ... 93 
(D) OTHER INFORMATION: 



(A) NAME /KEY : mat_peptide 

(B) LOCATION: 94... 4 29 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESC 

ATG AAA AAA TTT TTT TCT 

Met Lys Lys Phe Phe Ser 
-31 -30 

AAC GCG CTA CTG GCC ATG 

Asn Ala Leu Leu Ala Met 

-15 -10 

TAT TTG CAA GGG CAA GCC 
Tyr Leu Gin Gly Gin Ala 
5 

CAA GCC ACT AAC GCT ACT 
Gin Ala Thr Asn Ala Thr 
20 

CAA TTT TTC TTT GGG AAA 
Gin Phe Phe Phe Gly Lys 
35 

GAC TAC GCT CAT GCC AAT 
Asp Tyr Ala His Ala Asn 
50 55 

AGC GAA GTG GCG CAA TTG 
Ser Glu Val Ala Gin Leu 



RIPTION: SEQ ID NO: 13 3: 

CAA TCT TTA TTA GCT TTG 
Gin Ser Leu Leu Ala Leu 
-25 -20 

GAT GGC AAT GGC GTT TTT 
Asp Gly Asn Gly Val Phe 
-5 

CAA ATG CAT GCG GAT ATT 
Gin Met His Ala Asp He 
10 

ATC AAA GGC TTT GAT GCG 
He Lys Gly Phe Asp Ala 
25 

TAC TTT GGC TTG CGT GCT 
Tyr Phe Gly Leu Arg Ala 
40 45 

TCT ATT AGG CTT AAA AAC 
Ser He Arg Leu Lys Asn 
60 

GCG GGT CAA ATT CTT GGG 
Ala Gly Gin He Leu Gly 



ATT GTG TCT ATG 4 8 

He Val Ser Met 



TTA GGG GCG GGT 96 
Leu Gly Ala Gly 
1 

AAT TCT CAA AAA 144 
Asn Ser Gin Lys 
15 

CTT TTA GGG TAT 192 

Leu Leu Gly Tyr 

30 

TAT GGG TTT TTT 24 0 

Tyr Gly Phe Phe 



CCT AAC TAT AAC 2 88 

Pro Asn Tyr Asn 
65 

AAA CAA GAA ATC 33 6 

Lys Gin Glu He 
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70 75 80 

AAT CGC TTA ACG AGC CTT GCT GAT CCT AAA ACC TTT GAG CCA AAC ATG 3 84 

Asn Arg Leu Thr Ser Leu Ala Asp Pro Lys Thr Phe Glu Pro Asn Met 
85 90 95 

CTC ACT TAT GGG GGG GCT ATG GAT TTA ATG GTT AAT GTT CAT CAA TAA 432 
Leu Thr Tyr Gly Gly Ala Met Asp Leu Met Val Asn Val His Gin 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 134: 

Met Lys Lys Phe Phe Ser Gin Ser Leu Leu Ala Leu lie Val Ser Met 
-31 -30 -25 -20 

Asn Ala Leu Leu Ala Met Asp Gly Asn Gly Val Phe Leu Gly Ala Gly 
-15 -10 -5 1 

Tyr Leu Gin Gly Gin Ala Gin Met His Ala Asp lie Asn Ser Gin Lys 

5 10 15 

Gin Ala Thr Asn Ala Thr He Lys Gly Phe Asp Ala Leu Leu Gly Tyr 

20 25 30 

Gin Phe Phe Phe Gly Lys Tyr Phe Gly Leu Arg Ala Tyr Gly Phe Phe 

35 40 45 

Asp Tyr Ala His Ala Asn Ser He Arg Leu Lys Asn Pro Asn Tyr Asn 
50 55 60 65 

Ser Glu Val Ala Gin Leu Ala Gly Gin He Leu Gly Lys Gin Glu He 

70 75 80 

Asn Arg Leu Thr Ser Leu Ala Asp Pro Lys Thr Phe Glu Pro Asn Met 

85 90 95 

Leu Thr Tyr Gly Gly Ala Met Asp Leu Met Val Asn Val His Gin 
100 105 HO 

(2) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 
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(B) LOCATION: 1 . . .333 
(D) OTHER INFORMATION: 



(A) NAME/ KEY : sig_peptide 

(B) LOCATION: 1. . . 60 
(D) OTHER INFORMATION: 



(A) NAME /KEY : mat_pept ide 

(B) LOCATION: 61 . . . 333 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 

ATG AAA ACC TTT AAA AAC CTG CTC TGT TTT AGC CTG ATC GCT ATG AGT 4 8 

Met Lys Thr Phe Lys Asn Leu Leu Cys Phe Ser Leu lie Ala Met Ser 
-20 -15 -10 -5 

TGG CTC CAA GCG GAC ATG TTG GAT AAT TTC ACT AGG GCC ATT AAC AGC 96 
Trp Leu Gin Ala Asp Met Leu Asp Asn Phe Thr Arg Ala lie Asn Ser 
15 10 

TAC ACC ACT AAA AAG CTT AAT GAA ATC AAG GAT CAA GTC AAT AGC GCT 144 
Tyr Thr Thr Lys Lys Leu Asn Glu lie Lys Asp Gin Val Asn Ser Ala 
15 20 25 

AAC CCT ACT AAA AAT CAC AAT ACC ACT TAT AAC GCT AAT GGC ATG CTC 192 
Asn Pro Thr Lys Asn His Asn Thr Thr Tyr Asn Ala Asn Gly Met Leu 
30 35 40 

ATT AAC ATT GAT TGT AAA GTC TTA AAA AAT AAC TTC TAT TCG GTG TGT 24 0 

He Asn He Asp Cys Lys Val Leu Lys Asn Asn Phe Tyr Ser Val Cys 
45 50 55 60 

TAT TCT AGC GAG TTA AAA AAC CCT ATT TAT GGC GTG AGC GTG TTG TTT 28 8 

Tyr Ser Ser Glu Leu Lys Asn Pro He Tyr Gly Val Ser Val Leu Phe 
65 70 75 

GGG GAT TTA GTG GAT AAA AAT AAT ATT GAA AAA CGC TAT GAG TTT TAA 33 6 

Gly Asp Leu Val Asp Lys Asn Asn He Glu Lys Arg Tyr Glu Phe 
80 85 90 



(2) INFORMATION FOR SEQ ID NO: 136: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE 'DESCRIPTION: SEQ ID NO: 136: 

Met Lys Thr Phe Lys Asn Leu Leu Cys Phe Ser Leu lie Ala Met Ser 
-20 -15 -10 "5 

Trp Leu Gin Ala Asp Met Leu Asp Asn Phe Thr Arg Ala lie Asn Ser 

15 10 
Tyr Thr Thr Lys Lys Leu Asn Glu He Lys Asp Gin Val Asn Ser Ala 

15 20 25 

Asn Pro Thr Lys Asn His Asn Thr Thr Tyr Asn Ala Asn Gly Met Leu 

30 35 40 

He Asn He Asp Cys Lys Val Leu Lys Asn Asn Phe Tyr Ser Val Cys 
45 " 50 55 60 

Tyr Ser Ser Glu Leu Lys Asn Pro He Tyr Gly Val Ser Val Leu Phe 

65 70 75 

Gly Asp Leu Val Asp Lys Asn Asn He Glu Lys Arg Tyr Glu Phe 
80 85 90 



(2) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 81. -.2069 
(D) OTHER INFORMATION: 



(A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 81.. .144 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GTAAAAAATG GCTTATCTGT TCTAGCCTAC TCCCCTTATT TTTTCTTAAT CCCTTAGCGG 6 0 

CAGAAGATGA TGGGTTTTTT ATG GGG GTG AGT TAT CAA ACT TCT CTA GCT 110 

Met Gly Val Ser Tyr Gin Thr Ser Leu Ala 
-20 -15 

ATT CAA AGG GTG GAT AAC TCA GGG CTT AAC GCC AGT CAA GCC GCA TCC 15 8 

He Gin Arg Val Asp Asn Ser Gly Leu Asn Ala Ser Gin Ala Ala Ser 
-10 -5 1 * 

ACC TAC ATC CGC CAG AAC GCT ATC GCT CTA GAA TCT GCG GCG GTG CCT 2 06 

Thr Tyr He Arg Gin Asn Ala He Ala Leu Glu Ser Ala Ala Val Pro 
10 15 20 
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TTA GCC TAT TAT TTA GAA GCG ATG GGC CAA CAA ACC AGG GTT TTA ATG 2 54 

Leu Ala Tyr Tyr Leu Glu Ala Met Gly Gin Gin Thr Arg Val Leu Met 

25 30 35 

CAA ATG CTC TGC CCT GAT CCT TCC AAA CGC TGT TTG CTC TAT GCT GGA 3 02 

Gin Met Leu Cys Pro Asp Pro Ser Lys Arg Cys Leu Leu Tyr Ala Gly 
40 45 50 

GGT TAT AAA AAC GGA TCA AGT AAT ACT AAC GGC GAT ACA GGC AAC AAC 350 
Gly Tyr Lys Asn Gly Ser Ser Asn Thr Asn Gly Asp Thr Gly Asn Asn 

55 60 65 

CCC CCA AGA GGC AAT GTC AAT GCC ACC TTT GAT ATG CAA TCT CTA GTC 3 98 

Pro Pro Arg Gly Asn Val Asn Ala Thr Phe Asp Met Gin Ser Leu Val 
70 75 80 85 

AAT AAT TTA AAC AAG CTC ACC CAA CTC ATC GGC GAG ACT TTA ATC CGT 44 6 

Asn Asn Leu Asn Lys Leu Thr Gin Leu lie Gly Glu Thr Leu lie Arg 
90 95 100 

AAC CCT GAA AAT CTT TCT AAC GCC AAA GTC TTT AAT GTC AAA TTT GGC 4 94 

Asn Pro Glu Asn Leu Ser Asn Ala Lys Val Phe Asn Val Lys Phe Gly 
105 110 115 

AAT CAA AGC ACT GTT ATT GCA TTG CCT GAG GGT CTA GCC AAT ACC ATG 542 
Asn Gin Ser Thr Val lie Ala Leu Pro Glu Gly Leu Ala Asn Thr Met 
120 125 130 

AAC GCT TTA AAC GAT GAT ATT ACC AAC GCT TTA ACC ACG CTC TGG TAT 5 90 

Asn Ala Leu Asn Asp Asp lie Thr Asn Ala Leu Thr Thr Leu Trp Tyr 
135 140 145 

AAC CAA ACC TTA ACG AAT AAA TCT TTT AAT AGC GGT AAT TCC GTG AAT 63 8 

Asn Gin Thr Leu Thr Asn Lys Ser Phe Asn Ser Gly Asn Ser Val Asn 
150 155 160 165 

TTT AGC CCC CAA GTC TTG CAA CAC CTT TTA CAA GAC GGC TTA GCC ACA 686 
Phe Ser Pro Gin Val Leu Gin His Leu Leu Gin Asp Gly Leu Ala Thr 
170 175 180 

AGT AAT CAA ACC ATT TGC AGC ACT CAA AAC CAA TGC ACC GCC ACC AAT 734 
Ser Asn Gin Thr He Cys Ser Thr Gin Asn Gin Cys Thr Ala Thr Asn 
185 190 195 

GAA GCT AAA TCT ATC GCT CAA AAC GCC CAA AAC ATC TTC CAG GCT TTA 7 82 

Glu Ala Lys Ser He Ala Gin Asn Ala Gin Asn He Phe Gin Ala Leu 
200 205 210 

ATG CAA GCA GGG ATT TTA GGG GGC TTA GCC AAT GAA AAG CAA TTT GGC 83 0 

Met Gin Ala Gly He Leu Gly Gly Leu Ala Asn Glu Lys Gin Phe Gly 
215 220 225 

TTC ACT TAC AAC AAA GCC CCT AAT GGT AGC GAT TCC CAA CAA GGC TAC 87 8 

Phe Thr Tyr Asn Lys Ala Pro Asn Gly Ser Asp Ser Gin Gin Gly Tyr 
230 235 240 245 
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CAA AGC TTT AGC GGC " CCG GGT TAT TAC ACT AAA AAC GGC GCT AAT GGC 92 6 

Gin Ser Phe Ser Gly Pro Gly Tyr Tyr Thr Lys Asn Gly Ala Asn Gly 
250 255 260 

ACT ACC CAA GCG CCC TTG AAA GCA TTA CCC GCT GGA GCG ACA ATT GGA 974 
Thr Thr Gin Ala Pro Leu Lys Ala Leu Pro Ala Gly Ala Thr lie Gly 
265 270 275 

TCA GGC AAT GGC CAA TAC ACC TAC CAC CCC AGC TCG GCA GTC TAT TAT 102 2 

Ser Gly Asn Gly Gin Tyr Thr Tyr His Pro Ser Ser Ala Val Tyr Tyr 
280 285 290 

TTA GCC GAT AGC ATC ATT GCT AAT GGC ATC ACC GCT TCT ATG ATT TTT 107 0 

Leu Ala Asp Ser He He Ala Asn Gly He Thr Ala Ser Met He Phe 
295 300 305 

TCA GGC ATG CAA AAT TTC GCC AAT AAA GCC GCT AAA CTG ACA GGC ACT 1118 
Ser Gly Met Gin Asn Phe Ala Asn Lys Ala Ala Lys Leu Thr Gly Thr 
310 315 320 325 

TCA AGC TAT AGC CAG ATG CAA GAT GCG ATC AAT TAC GGG GAA AGC TTG 116 6 

Ser Ser Tyr Ser Gin Met Gin Asp Ala He Asn Tyr Gly Glu Ser Leu 
330 335 340 

CTC AGT AAC ACC GTA GCG TAT GGG GAT TTC ATC ACC AAT TGG GTC GCC 1214 
Leu Ser Asn Thr Val Ala Tyr Gly Asp Phe He Thr Asn Trp Val Ala 
345 350 355 

CCC TAT TTG GAT TTA AAC AAC AAA GGT TTG AAT TTC TTG CCT AGC TAT 12 62 

Pro Tyr Leu Asp Leu Asn Asn Lys Gly Leu Asn Phe Leu Pro Ser Tyr 
360 365 370 

GGG GGG CAA TTG AAT GGT GCT AAT CAT CAA ACC CCA CAA TTA ACC CCG 1310 
Gly Gly Gin Leu Asn Gly Ala Asn His Gin Thr Pro Gin Leu Thr Pro 
375 380 385 

CAA CAA GCC CAA CAA GAG CAA AAA GTC ATC ATG AAC CAA CTA GAG CAA 13 5 8 

Gin Gin Ala Gin Gin Glu Gin Lys Val He Met Asn Gin Leu Glu Gin 
390 395 400 405 

GCC ACA AAC GCC CCC ACC CCC GCG CAA ATA AAC AGG ATT TTA GCC AAC 14 06 

Ala Thr Asn Ala Pro Thr Pro Ala Gin He Asn Arg He Leu Ala Asn 
410 415 420 

CCC TAT TCC CCC ACG GCA AAA ACT TTA ATG GCT TAT GGG CTT TAT CGC 14 54 

Pro Tyr Ser Pro Thr Ala Lys Thr Leu Met Ala Tyr Gly Leu Tyr Arg 
425 430 435 

TCT AAA GCA GTG ATT GGC GGG GTG ATT GAT GAA ATG CAA ACT AAA GTG 15 0 2 

Ser Lys Ala Val He Gly Gly Val He Asp Glu Met Gin Thr Lys Val 
440 445 450 

AAT CAA GTC TAT CAA ATG GGC TTT GCT AGG AAT TTT TTG GAG CAT AAC 15 5 0 

Asn Gin Val Tyr Gin Met Gly Phe Ala Arg Asn Phe Leu Glu His Asn 
455 460 465 
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TCT AAT TCT AAT AAC'ATG AAC GGC TTT GGC GTG AAA ATG GGC TAT AAG 15 98 

Ser Asn Ser Asn Asn Met Asn Gly Phe Gly Val Lys Met Gly Tyr Lys 
470 475 480 485 

CAA TTC TTT GGC AAA AAG CGC ATG TTT GGG CTT AGG TAT TAT GGT TTT 164 6 

Gin Phe Phe Gly Lys Lys Arg Met Phe Gly Leu Arg Tyr Tyr Gly Phe 
490 495 500 

TAT GAT TTT GGT TAC GCT CAA TTT GGC GCA GAA TCT TCT TTA GTG AAA 16 94 

Tyr Asp Phe Gly Tyr Ala Gin Phe Gly Ala Glu Ser Ser Leu Val Lys 
505 510 515 

GCC ACC CTC TCT AGC TAT GGG GCA GGC ACA GAC TTT CTT TAT AAT GTT 174 2 

Ala Thr Leu Ser Ser Tyr Gly Ala Gly Thr Asp Phe Leu Tyr Asn Val 
520 525 530 

TTT ACC CGA AAA AGA GGG ACT GAA GCG ATA GAT ATC GGT TTT TTT GCC 17 90 

Phe Thr Arg Lys Arg Gly Thr Glu Ala He Asp He Gly Phe Phe Ala 
535 540 545 

GGT ATC CAA CTT GCA GGG CAA ACT TGG AAA ACG AAT TTT TTA GAT CAA 18-3 8 

Gly lie Gin Leu Ala Gly Gin Thr Trp Lys Thr Asn Phe Leu Asp Gin 
550 555 560 565 

GTG GAT GGC AAC CAT CTT AAA CCC AAA GAC ACT TCT TTC CAA TTC CTT 18 86 

Val Asp Gly Asn His Leu Lys Pro Lys Asp Thr Ser Phe Gin Phe Leu 
570 575 580 

TTT GAT TTA GGC ATA AGG ACC AAT TTT TCC AAA ATC GCT CAT CAA AAA 1934 
Phe Asp Leu Gly He Arg Thr Asn Phe Ser Lys He Ala His Gin Lys 
585 590 595 

AGA TCC CGT TTT TCT CAA GGG ATA GAA TTT GGC CTT AAA ATA CCG GTG 19 82 

Arg Ser Arg Phe Ser Gin Gly He Glu Phe Gly Leu Lys He Pro Val 
600 605 610 

CTT TAT CAC ACC TAT TAC CAA TCA GAA GGC GTT ACA GCG AAG TAT AGA 2 03 0 

Leu Tyr His Thr Tyr Tyr Gin Ser Glu Gly Val Thr Ala Lys Tyr Arg 
615 620 625 

AGA GCC TTT AGT TTT TAT GTG GGC TAC AAC ATA GGC TTT TGATTAAACA AA 2 081 
Arg Ala Phe Ser Phe Tyr Val Gly Tyr Asn He Gly Phe 
630 635 640 

ATAAGGGAAA AATATGATAA AAAAAG C TAG AAAATTCATA CCATTCTTTT TAATTGGCTC 2141 

CCTCTTAGCT GAAGACAATG GCTGGTATAT GTCTGTAGGC TATC 2185 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 663 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE " TYPE : protein 
(v) FRAGMENT TYPE: internal 
(ix) FEATURE: 

(A) NAME/ KEY : Signal Sequence 

(B) LOCATION: 1 ... 21 
(D) OTHER INFORMATION: 



(xi) SEQUENCE 

Met Gly Val Ser Tyr 
-20 

Ser Gly Leu Asn Ala 
-5 

Ala lie Ala Leu Glu 
15 

Ala Met Gly Gin Gin 
30 

Pro Ser Lys Arg Cys 
45 

Ser Asn Thr Asn Gly 
60 

Asn Ala Thr Phe Asp 
80 

Thr Gin Leu lie Gly 
95 

Asn Ala Lys Val Phe 
110 

Ala Leu Pro Glu Gly 
125 

lie Thr Asn Ala Leu 
140 

Lys Ser Phe Asn Ser 
160 

Gin His Leu Leu Gin 
175 

Ser Thr Gin Asn Gin 
190 

Gin Asn Ala Gin Asn 
205 

Gly Gly Leu Ala Asn 
220 

Pro Asn Gly Ser Asp 
240 

Gly Tyr Tyr Thr Lys 
255 

Lys Ala Leu Pro Ala 
270 

Thr Tyr His Pro Ser 
285 

Ala Asn Gly lie Thr 
300 

Ala Asn Lys Ala Ala 



DESCRIPTION: SEQ ID 

Gin Thr Ser Leu Ala 
-15 

Ser Gin Ala Ala Ser 

1 5 
Ser Ala Ala Val Pro 
20 

Thr Arg Val Leu Met 
35 

Leu Leu Tyr Ala Gly 
50 

Asp Thr Gly Asn Asn 
65 

Met Gin Ser Leu Val 
85 

Glu Thr Leu lie Arg 
100 

Asn Val Lys Phe Gly 
115 

Leu Ala Asn Thr Met 

130 

Thr Thr Leu Trp Tyr 
145 

Gly Asn Ser Val Asn 
165 

Asp Gly Leu Ala Thr 
180 

Cys Thr Ala Thr Asn 
195 

lie Phe Gin Ala Leu 
210 

Glu Lys Gin Phe Gly 
225 

Ser Gin Gin Gly Tyr 
245 

Asn Gly Ala Asn Gly 
260 

Gly Ala Thr He Gly 
275 

Ser Ala Val Tyr Tyr 
290 

Ala Ser Met He Phe 
305 

Lys Leu Thr Gly Thr 



NO : 13 8 : 

He Gin Arg Val Asp Asn 
-10 

Thr Tyr He Arg Gin Asn 
10 

Leu Ala Tyr Tyr Leu Glu 
25 

Gin Met Leu Cys Pro Asp 
40 

Gly Tyr Lys Asn Gly Ser 
55 

Pro Pro Arg Gly Asn Val 
70 75 
Asn Asn Leu Asn Lys Leu 
90 

Asn Pro Glu Asn Leu Ser 
105 

Asn Gin Ser Thr Val He 
120 

Asn Ala Leu Asn Asp Asp 
135 

Asn Gin Thr Leu Thr Asn 
150 155 
Phe Ser Pro Gin Val Leu 
170 

Ser Asn Gin Thr He Cys 
185 

Glu Ala Lys Ser He Ala 
200 

Met Gin Ala Gly He Leu 
215 

Phe Thr Tyr Asn Lys Ala 
230 235 
Gin Ser Phe Ser Gly Pro 
250 

Thr Thr Gin Ala Pro Leu 
265 

Ser Gly Asn Gly Gin Tyr 
280 

Leu Ala Asp Ser He He 
295 

Ser Gly Met Gin Asn Phe 
310 315 
Ser Ser Tyr Ser Gin Met 
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320 325 330 

Gin Asp Ala lie Asn Tyr Gly Glu Ser Leu Leu Ser Asn Thr Val Ala 

335 340 345 

Tyr Gly Asp Phe lie Thr Asn Trp Val Ala Pro Tyr Leu Asp Leu Asn 

350 355 360 

Asn Lys Gly Leu Asn Phe Leu Pro Ser Tyr Gly Gly Gin Leu Asn Gly 

365 370 375 

Ala Asn His Gin Thr Pro Gin Leu Thr Pro Gin Gin Ala Gin Gin Glu 
380 385 390 395 

Gin Lys Val lie Met Asn Gin Leu Glu Gin Ala Thr Asn Ala Pro Thr 

400 405 410 

Pro Ala Gin lie Asn Arg lie Leu Ala Asn Pro Tyr Ser Pro Thr Ala 

415 420 425 

Lys Thr Leu Met Ala Tyr Gly Leu Tyr Arg Ser Lys Ala Val lie Gly 

430 435 440 

Gly Val He Asp Glu Met Gin Thr Lys Val Asn Gin Val Tyr Gin Met 

445 450 455 

Gly Phe Ala Arg Asn Phe Leu Glu His Asn Ser Asn Ser Asn Asn Met 
460 465 470 475 

Asn Gly Phe Gly Val Lys Met Gly Tyr Lys Gin Phe Phe Gly Lys Lys 

480 485 490 

Arg Met Phe Gly Leu Arg Tyr Tyr Gly Phe Tyr Asp Phe Gly Tyr Ala 

495 500 505 

Gin Phe Gly Ala Glu Ser Ser Leu Val Lys Ala Thr Leu Ser Ser Tyr 

510 515 520 

Gly Ala Gly Thr Asp Phe Leu Tyr Asn Val Phe Thr Arg Lys Arg Gly 

525 530 535 

Thr Glu Ala He Asp He Gly Phe Phe Ala Gly He Gin Leu Ala Gly 
540 545 550 555 

Gin Thr Trp Lys Thr Asn Phe Leu Asp Gin Val Asp Gly Asn His Leu 

560 565 570 

Lys Pro Lys Asp Thr Ser Phe Gin Phe Leu Phe Asp Leu Gly He Arg 

575 580 585 

Thr Asn Phe Ser Lys He Ala His Gin Lys Arg Ser Arg Phe Ser Gin 

590 595 600 

Gly He Glu Phe Gly Leu Lys He Pro Val Leu Tyr His Thr Tyr Tyr 

605 610 615 

Gin Ser Glu Gly Val Thr Ala Lys Tyr Arg Arg Ala Phe Ser Phe Tyr 
620 625 630 635 

Val Gly Tyr Asn He Gly Phe 
640 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 51... 1160 
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(D) OTHER INFORMATION: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

ATTATTTTTA ATCTTGCATG AAATCTTAAA TATAGAATTA GTCCCTTTGG ATG GGA 5 6 

Met Gly 
1 

TTT TCN CTC GCG CTA GGC TAT TTG TGT TTG TTT ATA TTC GTT TTA AGC 104 
Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe He Phe Val Leu Ser 
5 10 15 

GCT TCT TTA ATC TCT GAA AAA GCC TTA TCC AAG CAG TAT TTG CAA ACC 152 
Ala Ser Leu He Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu Gin Thr 
20 25 30 

GCT AAA GAT AAA ATC ACC TCT TTA AAG AAT TTA AAA GTC ATC GCC ATT 2 00 

Ala Lys Asp Lys He Thr Ser Leu Lys Asn Leu Lys Val He Ala He 
35 40 45 50 

ACC GGA AGC TTT GGG AAA ACC AGC ACC AAA AAT TTC TTG CTT CAA ATC 24 8 

Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu Gin He 
55 60 65 

TTA CAA ACC ACA TTC AAC GCG CAT GCA AGC CCC AAA AGC GTC AAT ACC 2 96 

Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val Asn Thr 
70 75 80 

CTT TTA GGG CTT GCG AAT GAT ATT AAT CAG AAT TTA GAC GAT AGG AGT 3 44 

Leu Leu Gly Leu Ala Asn Asp He Asn Gin Asn Leu Asp Asp Arg Ser 
85 90 95 

GAA ATC TAT ATC GCT GAA GCC GGG GCA AGG AAT AAG GGC GAT ATT AAA 3 92 

Glu He Tyr He Ala Glu Ala Gly Ala Arg Asn Lys Gly Asp He Lys 
100 105 HO 

GAA ATC ACC TGT CTC ATT GAA CCG CAC CTT GTT GTG GTT GCA GAA GTG 44 0 

Glu He Thr Cys Leu He Glu Pro His Leu Val Val Val Ala Glu Val 
115 120 125 130 

GGC GAA CAG CAT TTA GAA TAG TTT AAA ACT TTA GAA AAT ATT TGC GAG 4 88 

Gly Glu Gin His Leu Glu Tyr Phe Lys Thr Leu Glu Asn He Cys Glu 
135 140 145 

ACT AAA GCG GAA TTA TTG GAT TCC AAA CGC TTA GAA AAA GCC TTT TGT 5 36 

Thr Lys Ala Glu Leu Leu Asp Ser Lys Arg Leu Glu Lys Ala Phe Cys 
150 155 1^0 

TAC TCG GTG GAA AAG ATC AAG CCC TAT GCC CCT AAA GAT AGC CCT TTA 5 84 

Tyr Ser Val Glu Lys He Lys Pro Tyr Ala Pro Lys Asp Ser Pro Leu 
165 170 175 

ATA GAC TAT TCT AGC CTG GTT AAA AAC ATC CAA TCC ACT TTA AAA GGC 63 2 
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Ile Asp Tyr Ser Ser Leu Val Lys Asn lie Gin Ser Thr Leu Lys Gly 
180 185 190 

ACT TCT TTT GAA ATG CTT ATA GGT AGC GTT TGG GAA AGA TTT GAA ACA 680 
Thr Ser Phe Glu Met Leu He Gly Ser Val Trp Glu Arg Phe Glu Thr 
195 200 205 210 

AAG GTT CTA GGG GAG TTT AGC GCT TAT AAT ATC GCT TCA GCC ATT TTA 72 8 

Lys Val Leu Gly Glu Phe Ser Ala Tyr Asn He Ala Ser Ala He Leu 
215 220 225 

ATC GCT AAG CAT TTA GGC TTA GAG ACC GAA AGG ATC AAA CGG CTT GTT 776 
He Ala Lys His Leu Gly Leu Glu Thr Glu Arg He Lys Arg Leu Val 
230 235 240 

TTA GAA CTC AAC CCT ATT GCT CAT CGT TTG CAA CTT TTG GAA GTG AAT 824 
Leu Glu Leu Asn Pro lie Ala His Arg Leu Gin Leu Leu Glu Val Asn 
245 250 255 

CAA AAA ATC ATC ATA GAC GAT AGC TTT AAT GGG AAT TTA AAG GGC ATG 872 
Gin Lys He lie He Asp Asp Ser Phe Asn Gly Asn Leu Lys Gly Met 
260 265 270 

TTA GAG GGC ATT CGT TTA GCG AGT TTG CAC AAA GGG CGT AAA GTC ATT 92 0 

Leu Glu Gly He Arg Leu Ala Ser Leu His Lys Gly Arg Lys Val He 
275 280 285 290 

GTA ACA CCG GGC TTA GTG GAA AGC AAT ACA GAA AGT AAT GAG GCT TTA 96 8 

Val Thr Pro Gly Leu Val Glu Ser Asn Thr Glu Ser Asn Glu Ala Leu 
295 300 305 

GCG CAA AAA ATA GAC GGG GTT TTT GAT GTC GCT ATC ATC ACA GGG GAG 1016 
Ala Gin Lys He Asp Gly Val Phe Asp Val Ala He He Thr Gly Glu 
310 315 320 

TTG AAT TCC AAA ACG ATT GCT TCA CAA TTG AAA ACC CCC CAA AAA ATC 10 64 

Leu Asn Ser Lys Thr He Ala Ser Gin Leu Lys Thr Pro Gin Lys He 
325 330 335 

TTA CTC AAG GAT AAG GCG CAA TTG GAA AAT ATC TTA CAA GCC ACC ACG 1112 
Leu Leu Lys Asp Lys Ala Gin Leu Glu Asn He Leu Gin Ala Thr Thr * 
340 345 350 

ATT CAA GGC GAT TTG ATT TTA TTC GCT AAT GAC GCC CCT AAT TAC ATT T 1161 
He Gin Gly Asp Leu He Leu Phe Ala Asn Asp Ala Pro Asn Tyr He 
355 360 365 370 

AGGAAATGAA C ATG C AAC AT TTATACGCTC CTTGGCGCGA AAGTTATTTG AA 1213 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE : internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

Met Gly Phe Xaa Leu Ala Leu Gly Tyr Leu Cys Leu Phe lie Phe Val 

1 5 10 15 

Leu Ser Ala Ser Leu lie Ser Glu Lys Ala Leu Ser Lys Gin Tyr Leu 

20 25 30 

Gin Thr Ala Lys Asp Lys lie Thr Ser Leu Lys Asn Leu Lys Val lie 

35 40 45 

Ala lie Thr Gly Ser Phe Gly Lys Thr Ser Thr Lys Asn Phe Leu Leu 

50 55 60 

Gin He Leu Gin Thr Thr Phe Asn Ala His Ala Ser Pro Lys Ser Val 
65 70 75 80 

Asn Thr Leu Leu Gly Leu Ala Asn Asp He Asn Gin Asn Leu Asp Asp 

85 90 95 

Arg Ser Glu He Tyr He Ala Glu Ala Gly Ala Arg Asn Lys Gly Asp 

100 105 HO 

He Lys Glu He Thr Cys Leu He Glu Pro His Leu Val Val Val Ala 

115 120 125 

Glu Val Gly Glu Gin His Leu Glu Tyr Phe Lys Thr Leu Glu Asn He 

130 135 140 

Cys Glu Thr Lys Ala Glu Leu Leu Asp Ser Lys Arg Leu Glu Lys Ala 
145 150 155 160 

Phe Cys Tyr Ser Val Glu Lys He Lys Pro Tyr Ala Pro Lys Asp Ser 

165 170 175 

Pro Leu He Asp Tyr Ser Ser Leu Val Lys Asn He Gin Ser Thr Leu 

180 185 190 

Lys Gly Thr Ser Phe Glu Met Leu He Gly Ser Val Trp Glu Arg Phe 

195 200 205 

Glu Thr Lys Val Leu Gly Glu Phe Ser Ala Tyr Asn He Ala Ser Ala 

210 215 220 

He Leu He Ala Lys His Leu Gly Leu Glu Thr Glu Arg He Lys Arg 
225 230 235 240 

Leu Val Leu Glu Leu Asn Pro He Ala His Arg Leu Gin Leu Leu Glu 

245 250 255 

Val Asn Gin Lys He He He Asp Asp Ser Phe Asn Gly Asn Leu Lys 

260 265 270 

Gly Met Leu Glu Gly He Arg Leu Ala Ser Leu His Lys Gly Arg Lys 

275 280 285 

Val He Val Thr Pro Gly Leu Val Glu Ser Asn Thr Glu Ser Asn Glu 

290 295 300 

Ala Leu Ala Gin Lys He Asp Gly Val Phe Asp Val Ala He He Thr 
305 310 315 320 

Gly Glu Leu Asn Ser Lys Thr He Ala Ser Gin Leu Lys Thr Pro Gin 

325 330 335 

Lys He Leu Leu Lys Asp Lys Ala Gin Leu Glu Asn He Leu Gin Ala 

340 345 350 

Thr Thr He Gin Gly Asp Leu He Leu Phe Ala Asn Asp Ala Pro Asn 
355 360 365 

Tyr He 
370 
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(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 82... 270 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

ACTTAAAGGC ATAAAAACCT TAAGCTTTTT GAGTTTCAAA AGGGTTTCAA GCTTTTTATA 6 0 

AGACTTTTTT TGAATGAGTA A GGA GAA AAT ATT TTG TTC CAT AAA CTG -ATC 111 

Gly Glu Asn lie Leu Phe His Lys Leu lie 
1 5 10 

TTA ACA TGC TTT TTA GCG CTT GTA GCA ATA ACC ATT CAA GCT TGC GGT 15 9 

Leu Thr Cys Phe Leu Ala Leu Val Ala He Thr He Gin Ala Cys Gly 
15 20 25 

TAT AAA GCC CCT CCA TTC AAT GAA AAA CCC GCT AAA AAA ACT TCA AAC 2 07 

Tyr Lys Ala Pro Pro Phe Asn Glu Lys Pro Ala Lys Lys Thr Ser Asn 
30 35 40 

AGC TCT AAT TCT TCT ATG CAA ACG CCC ACC AAC AGC ACC ACG CCA GAA 2 55 

Ser Ser Asn Ser Ser Met Gin Thr Pro Thr Asn Ser Thr Thr Pro Glu 
45 50 • 55 

TTT TTA AAT CAG CCT TAAAATCACT GCTCTTGTTT AAGGGCTTTG ATTTCTAGGG T 311 
Phe Leu Asn Gin Pro 
60 

TTTTGTGGCT AACTTTTGAN STTCGCTTTC AT C ATG CG TT ACCATAATG 3 60 

(2) INFORMATION FOR SEQ ID NO: 14 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 
Gly Glu Asn He Leu Phe His Lys Leu He Leu Thr Cys Phe Leu Ala 
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1 5 10 15 

Leu Val Ala lie Thr lie Gin Ala Cys Gly Tyr Lys Ala Pro Pro Phe 

20 25 30 

Asn Glu Lys Pro Ala Lys Lys Thr Ser Asn Ser Ser Asn Ser Ser Met 

35 40 45 

Gin Thr Pro Thr Asn Ser Thr Thr Pro Glu Phe Leu Asn Gin Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1024 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 115... 921 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:143: 

AGTTGGCAAA AACGCAGAGA CAGTAACGCA AAGGCAAATA AAGAGACTCA TTTTAAACAA 6 0 

GCGAATGCCA TTACAAATAT AATCAGATCA GTTGGTGGGT TTTTTACAAA GATT ATG 117 

Met 
1 

AAG AGA GTT AGA GAA CTT GTA AAA AAA CAT CCC GAG AAA AGC AGT GTG 165 
Lys Arg Val Arg Glu Leu Val Lys Lys His Pro Glu Lys Ser Ser Val 
5 10 15 

GCA TTA GTA GTA TTA ACC CAT GCT GCA TGC AAG AAA GCG AAA GAA TTG 213 
Ala Leu Val Val Leu Thr His Ala Ala Cys Lys Lys Ala Lys Glu Leu 
20 25 30 

GAC GAT AAA GTC CAG GAT AAA TCC AAA CAA GCT GAA AAA GAA AAT CAA 2 61 

Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu Asn Gin 
35 40 45 

ATC AAT TGG TGG AAA TAT TCA GGA TTA ACA ATA GCG ACA AGT TTA TTA 3 09 

lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Thr Ser Leu Leu 
50 55 60 65 

TTA GCC GCT TGT AGT GTT GGT GAT ATT GAT AAA CAG ATA GAG TTA GAA 3 57 

Leu Ala Ala Cys Ser Val Gly Asp He Asp Lys Gin He Glu Leu Glu 
70 75 80 

CAA GAA AAA AAG GAA GCT GAA AAC GCT AGG GAT AGA GCG AAC AAG AGT 4 05 

Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn Lys Ser 
85 90 95 
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GGG ATA GAA CTG GAA CAG- GAA AAA CAA AAG ACC ATT AAA GAA CAA AAA 453 

Gly lie Glu Leu Glu Gin Glu Lys Gin Lys Thr lie Lys Glu Gin Lys 

100 105 110 

GAT TTA GTT AAA AAA GCA GAA CAA AAT TGC CAA GAA AAT CAT GGC CAA 501 
Asp Leu Val Lys Lys Ala Glu Gin Asn Cys Gin Glu Asn His Gly Gin 
115 120 125 

TTC TTT ATG AAA AAA TTA GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA 54 9 

Phe Phe Met Lys Lys Leu Gly lie Lys Gly Gly lie Ala He Glu Val 
130 135 140 145 

GAA GCT GAA TGC AAA ACC CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT 5 97 

Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro 
150 155 160 

ATC CAG CCA AAA CAC CTC CCC AAC TCT AAA CAA CCC CAC TCT CAA AGA 645 
He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His Ser Gin Arg 
165 170 175 

GGA TCA AAA GCG CAA GAG CTT ATC GCT TAT TTG CAA AAA GAG TTA GAA 6 93 

Gly Ser Lys Ala Gin Glu Leu He Ala Tyr Leu Gin Lys Glu Leu Glu 
180 185 190 

TCT CTG CCC TAT TCA CAA AAA GCT ATC GCT AAA CAA GTG AAT TTT TAC 741 
Ser Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val Asn Phe Tyr 
195 200 205 

AGG CCA AGT TCT GTC GCT TAT TTA GAA CTA GAC CCT AGA GAT TTT AAG 789 
Arg Pro Ser Ser Val Ala Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys 
210 215 220 225 

GTT ACA GAA GAA TGG CAA AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT 83 7 

Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg Ser Lys Ala 
230 235 240 

CAA GCT AAA ATG CTT GGA AAT GAG AAA CCC ACA AGC CCA CCT TTC AAC 8 85 

Gin Ala Lys Met Leu Gly Asn Glu Lys Pro Thr Ser Pro Pro Phe Asn 
245 250 255 

CTC TCA AAG CCT TTT GTT CGT TCA AAA AAT ATT TGC TGATGTTAAT AAAGAA 93 7 
Leu Ser Lys Pro Phe Val Arg Ser Lys Asn He Cys 
260 265 

ATAGAAGCAG TTGCTAATAC TGAAAAGAAA G C AG AAAAAG MGGGTTATGG TTATAGTAAA 9 97 

AGGATGTAGG CATAAGAAAA TAAGAAC 1024 

(2) INFORMATION FOR SEQ ID NO:144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



Met 


Lys 


Arg 


Val 


Arg 


Glu 


Leu 


Val 


Lys 


Lys 


TJ -J c> 

±11 s 


Pro 


OX IX 


Lys 


Ser 


Ser 


1 






5 










1 V 










15 




Val 


Ala 


Leu 


Val 


Val 


Leu 


Thr 


His 


Ala 


ft! a 

Al a 


Cys 


Lys 


Lys 


Ala 


Lys 


Glu 








20 










25 










3 0 






Leu 


Asp 


Asp 


Lys 


Val 


Gin 


Asp 


Lys 


Ser 


Lys 


bin 


Ala 


o± u 


Lys 


w X LL 


As n 






35 










40 










4 5 








Gin 


lie 


Asn 


Trp 


Trp 


Lys 


Tyr 


Ser 


Gly 


Leu 


Thr 


Tip 
_L J. C 


fil Ct 


TVi -r 
J. J. IX 


Q y 
OCX 


Leu 




50 










55 










D U 










Leu 


Leu 


Ala 


Ala 


Cys 


Ser 


Val 


Gly Asp 


lie 


Asp 


Lys 


Liin 


lie 


uJ.U 


Leu 


65 










70 










"7 C 

/ o 










ft n 


Glu 


Gin 


Glu 


Lys 


Lys 


Glu 


Ala 


vjlU 


Asn 


TV "1 _ 

Ala 


Arg 


Asp 


Arg 




Asn 


Lys 










85 










9 0 














Ser 


Gly 


He 


Glu 


Leu 


Glu 


Gin 


Glu 


Lys 


Gin 


Lys 


i nr 


lie 


Lys 


m it 

ulu 


"Jin 






100 










105 










1 i n 
j. j. \j 






Lys 


Asp 


Leu 


Val 


Lys 


Lys 


Ala 


Glu 


Gin 


Asn 


Cys 


pin 


OX U 


As n 


n x s 


ox y 






115 










120 


















Gin 


Phe 


Phe 


Met 


Lys 


Lys 


Leu 


Gly 


He 


Lys 


Gly 


Gly 


Tic* 

lie 


ai a 


lie 


OlU 




130 










135 










14 0 










Val 


Glu 


Ala 


Glu 


Cys 


Lys 


Thr 


Pro 


Lys 


Pro 


Ala 


Lys 


1 III 


Asn 


Olil 


X llX 


145 




















155 










160 


Pro 


He 


Gin 


Pro 


Lys 


His 


Leu 


Pro 


Asn 


Ser 


Lys 


Gin 


Pro 


His 


Ser 


Gin 










165 










170 










175 




Arg 


Gly 


Ser 


Lys 


Ala 


Gin 


Glu 


Leu 


He 


Ala 


Tyr 


Leu 


Gin 


Lys 


Glu 


Leu 




180 










185 










190 






Glu 


Ser 


Leu 


Pro 


Tyr 


Ser 


Gin 


Lys 


Ala 


He 


Ala 


Lys 


Gin 


Val 


Asn 


Phe 






195 










200 










205 








Tyr 


Arg 


Pro 


Ser 


Ser 


Val 


Ala 


Tyr 


Leu 


Glu 


Leu 


Asp 


Pro 


Arg 


Asp 


Phe 


210 










215 










220 










Lys 


Val 


Thr 


Glu 


Glu 


Trp 


Gin 


Lys 


Glu 


Asn 


Leu 


Lys 


He 


Arg 


Ser 


Lys 


225 










230 










235 










240 


Ala 


Gin 


Ala 


Lys 


Met 


Leu 


Gly 


Asn 


Glu 


Lys 


Pro 


Thr 


Ser 


Pro 


Pro 


Phe 








245 










250 










255 




Asn 


Leu 


Ser 


Lys 


Pro 


Phe 


Val 


Arg 


Ser 


Lys 


Asn 


He 


Cys 









260 265 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 88 . . . 603 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 5: 

AAAATAAGGA GGAATTGTTT GATTTTACGA TTGGCTGGAG CAAGCGTTTT AACGGCTTGT 6 0 

GTCTTTTCGG GGTGTTTTTT TTTAAAA ATG TTT GAT AAA AAA CTT TCT AGT AAC 114 

Met Phe Asp Lys Lys Leu Ser Ser Asn 
1 5 

GAT TGG CAT ATC CAA AAA GTG GAA ATG AAC CAT CAA GTC TAT GAC ATT 162 
Asp Trp His lie Gin Lys Val Glu Met Asn His Gin Val Tyr Asp lie 
10 15 ' 20 25 

GAA ACC ATG CTC GCT GAT AGC GCT TTT AGA GAG CAT GAA GAA GAG CAA 210 
Glu Thr Met Leu Ala Asp Ser Ala Phe Arg Glu His Glu Glu Glu Gin 
30 35 40 

GAT TCC TCT CTA AAT ACC GCT TTG CCT GAA GAT AAA ACA GCG ATT GAA 25 8 

Asp Ser Ser Leu Asn Thr Ala Leu Pro Glu Asp Lys Thr Ala lie Glu 
45 50 55 

GCC AAA GAG CAA GAG CAA AAA GAA AAA AGA AAA CGC TGG TAT GAG CTT 3 06 

Ala Lys Glu Gin Glu Gin Lys Glu Lys Arg Lys Arg Trp Tyr Glu Leu 
60 65 70 

TTT AAA AAG AAA CCA AAG CCC AAA AGC TCT ATG GGA GAG TTT GTG TTT 3 54 

Phe Lys Lys Lys Pro Lys Pro Lys Ser Ser Met Gly Glu Phe Val Phe 
75 80 85 

GAT CAA AAA GAA AAT CGT ATT TAT GGC AAA GGC TAT TGC AAC CGG TAT 4 02 

Asp Gin Lys Glu Asn Arg lie Tyr Gly Lys Gly Tyr Cys Asn Arg Tyr 
90 95 100 105 

TTT GCC AGC TAT GTA TGG CAG GGC GAT AGG CAC ATT GGG ATT GAA GAT 450 
Phe Ala Ser Tyr Val Trp Gin Gly Asp Arg His lie Gly lie Glu Asp 
110 115 120 

AGC GGG ATT TCA AGA AAA GTG TGT AAA GAT GAG CAT TTA ATG GCG TTT 4 98 

Ser Gly lie Ser Arg Lys Val Cys Lys Asp Glu His Leu Met Ala Phe 
125 130 135 

GAA TTG GAA TTT ATG GAG AAT TTT AAG GGT AAT TTT ACG GTA ACT AAG 54 6 

Glu Leu Glu Phe Met Glu Asn Phe Lys Gly Asn Phe Thr Val Thr Lys 
140 145 150 

GGC AAG GAC ACG CTC ATT TTA GAC AAC CAA AAA ATG AAA ATT TAT TTG 5 94 

Gly Lys Asp Thr Leu lie Leu Asp Asn Gin Lys Met Lys lie Tyr Leu 
155 160 165 

AAA ACG CCT TGAGTGGGTT TTTGATTTCA AAACAATCTA AGATCACTAA ATTAGGGAT 652 

Lys Thr Pro 

170 

TAAAAAGAAA TTTTTAA 66 9 
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( 2 ) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 172 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



Met 


Phe 


Asp 


Lys 


Lys 


Leu 


Ser 


Ser 


Asn 


Asp 


Trp 


His 


He 


Gin 


Lys 


Val 


1 








5 










10 










15 




Glu 


Met 


Asn 


His 
20 


Gin 


Val 


Tyr 


Asp 


He 
25 


Glu 


Thr 


Met 


Leu 


Ala 
30 


Asp 


Ser 


Ala 


Phe 


Arg 
35 


Glu 


His 


Glu 


Glu 


Glu 
40 


Gin 


Asp 


Ser 


Ser 


Leu 
45 


Asn 


Thr 


Ala 


Leu 


Pro 
50 


Glu 


Asp 


Lys 


Thr 


Ala 
55 


He 


Glu 


Ala 


Lys 


Glu 
60 


Gin 


Glu 


Gin 


Lys 


Glu 


Lys 


Arg 


Lys 


Arg 


Trp 


Tyr 


Glu 


Leu 


Phe 


Lys 


Lys 


Lys 


Pro 


Lys 


Pro 


65 










70 










75 










80 


Lys 


Ser 


Ser 


Met 


Gly 


Glu 


Phe 


Val 


Phe 


Asp 


Gin 


Lys 


Glu 


Asn 


Arg 


He 








85 










90 










95 




Tyr 


Gly 


Lys 


Gly 
100 


Tyr 


Cys 


Asn 


Arg 


Tyr 
105 


Phe 


Ala 


Ser 


Tyr 


Val 
110 


Trp 


Gin 


Gly 


Asp 


Arg 
115 


His 


He 


Gly 


He 


Glu 
120 


Asp 


Ser 


Gly 


He 


Ser 
125 


Arg 


Lys 


Val 


Cys 


Lys 


Asp 


Glu 


His 


Leu 


Met 


Ala 


Phe 


Glu 


Leu 


Glu 


Phe 


Met 


Glu 


Asn 


130 










135 










140 










Phe 


Lys 


Gly 


Asn 


Phe 


Thr 


Val 


Thr 


Lys 


Gly 


Lys 


Asp 


Thr 


Leu 


He 


Leu 


145 






150 










155 










160 


Asp 


Asn 


Gin 


Lys 


Met 


Lys 


He 


Tyr 


Leu 


Lys 


Thr 


Pro 











165 170 

(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 87... 1280 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
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ATCAATCTAA CTTGAGTGGA TTTTTCGTAT TAGTTTCCAT GATATAATTT TGAAAAGTAA 6 0 

GATTGTTTTT TAAAAAAAGG TTGGTA ATG GAA TCA GTA AAA ACA GGA AAA ACA 113 

Met Glu Ser Val Lys Thr Gly Lys Thr 
1 5 

AAT AAG GTT GGC AAG AAT ACA GAG ATG GCT AAT ACA AAG GCA AAT AAA 161 

Asn Lys Val Gly Lys Asn Thr Glu Met Ala Asn Thr Lys Ala Asn Lys 
10 15 20 25 

GAG ACT CAT TTT AAA CAA GTG AGC GCC ATT ACA AAT ATA ATC AGA TCA 2 09 

Glu Thr His Phe Lys Gin Val Ser Ala lie Thr Asn lie lie Arg Ser 
30 35 40 

GTT GGT GGG TTT TTT ACA AAA ATT GCA AAG AGA GTT AGA GGA CTT GTA 25 7 

Val Gly Gly Phe Phe Thr Lys lie Ala Lys Arg Val Arg Gly Leu Val 
45 50 55 

AAA AAA CAC CCC AAG AAA AGC AGT GCG GCA TTA GTA GTA TTG ACC CAT 3 05 

Lys Lys His Pro Lys Lys Ser Ser Ala Ala Leu Val Val Leu Thr His 

60 65 70 

ATT GCG TGC AAG AAA GCG AAA GAA TTA GAC GAT AAA GTC CAA GAT AAA 3 53 

lie Ala Cys Lys Lys Ala Lys Glu Leu Asp Asp Lys Val Gin Asp Lys 
75 80 85 

TCC AAA CAA GCT GAA AAA GAA AAT CAA ATC AAT TGG TGG AAA TAT TCA 4 01 

Ser Lys Gin Ala Glu Lys Glu Asn Gin lie Asn Trp Trp Lys Tyr Ser 
90 95 100 105 

GGA TTA ACA ATA GCG GCA AGT TTA TTA TTA GCC GCT TGT AGC GCT GGT 44 9 

Gly Leu Thr lie Ala Ala Ser Leu Leu Leu Ala Ala Cys Ser Ala Gly 
110 115 120 

GAT ACT GAT AAA CAG ATA GAA CTA GAA CAA GAA AAA AAG GAA GCT GAA 4 97 

Asp Thr Asp Lys Gin lie Glu Leu Glu Gin Glu Lys Lys Glu Ala Glu 
125 130 135 

AAC GCT AGG GAT AGA GCG AAC AAG AGT GGG ATA GAA CTA GAA CAA GAA 54 5 

Asn Ala Arg Asp Arg Ala Asn Lys Ser Gly lie Glu Leu Glu Gin Glu 

140 145 150 

AGA CAG AAA ACA AAC AAG AGT GGG ATA GAA CTC GCT AAT AGT CAA ATA 5 93 

Arg Gin Lys Thr Asn Lys Ser Gly lie Glu Leu Ala Asn Ser Gin lie 
155 160 165 

AAA GCA GAA CAA GAA AGA CAA AAG ACA GAA CAA GAA AAA CAA AAA GCA 641 

Lys Ala Glu Gin Glu Arg Gin Lys Thr Glu Gin Glu Lys Gin Lys Ala 
170 175 180 185 

AAT AAG AGT GCG ATA GAG TTA GAA CAG CAA AAA CAA AAG ACC ATT AAT 68 9 

Asn Lys Ser Ala lie Glu Leu Glu Gin Gin Lys Gin Lys Thr lie Asn 
190 195 200 

ACA CAA AGA GAT TTG ATT AAA GAA CAG AAA GAT TTC ATT AAA GAA ACA 73 7 

Thr Gin Arg Asp Leu lie Lys Glu Gin Lys Asp Phe He Lys Glu Thr 
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205 210 215 

GAA CAA AAT TGC CAA GAA AAT CAT AAT CAA TTC TTT ATT AAA AAA TTA 7 85 

Glu Gin Asn Cys Gin Glu Asn His Asn Gin Phe Phe He Lys Lys Leu 
220 225 230 

GGA ATT AAG GGT GGC ATT GCT ATA GAA GTA GAA GCT GAA TGC AAA ACC 833 
Gly He Lys Gly Gly He Ala He Glu Val Glu Ala Glu Cys Lys Thr 
235 240 245 

CCT AAA CCT GCA AAA ACC AAT CAA ACC CCT ATC CAG CCA AAA CAC CTC 881 
Pro Lys Pro Ala Lys Thr Asn Gin Thr Pro He Gin Pro Lys His Leu 
250 255 260 265 

CCA AAC TCT AAA CAA CCT CAT TCT CAA AGA GGA TCA AAA GCG CAA GAG 92 9 

Pro Asn Ser Lys Gin Pro His Ser Gin Arg Gly Ser Lys Ala Gin Glu 
270 275 280 

TTT ATC GCT TAT TTG CAA AAA GAG CTA GAA TTT CTG CCC TAT TCG CAA 977 
Phe He Ala Tyr Leu Gin Lys Glu Leu Glu Phe Leu Pro Tyr Ser Gin 
285 290 295 

AAA GCT ATC GCT AAA CAA GTG AAT TTC TAT AAA CCA AGT TCT ATC GCT 1025 
Lys Ala He Ala Lys Gin Val Asn Phe Tyr Lys Pro Ser Ser He Ala 
300 305 310 

TAT TTA GAA CTA GAT CCT AGA GAT TTT AAG GTT ACA GAA GAA TGG CAA 1073 
Tyr Leu Glu Leu Asp Pro Arg Asp Phe Lys Val Thr Glu Glu Trp Gin 
315 320 325 

AAA GAA AAT CTA AAA ATA CGC TCT AAA GCT CAA GCT AAA ATG CTT GAA 1121 
Lys Glu Asn Leu Lys He Arg Ser Lys Ala Gin Ala Lys Met Leu Glu 
330 335 340 345 

ATG AGG GAT TTA AAA CCA GAC CCA CAA GCC CAC CTT CCA ACC TCT CAA 116 9 

Met Arg Asp Leu Lys Pro Asp Pro Gin Ala His Leu Pro Thr Ser Gin 

350 355 360 

AGC CTT TTG TTC GTT CAA AAA ATA TTT GCT GAT GTT AAT AAA GAA ATA 1217 

Ser Leu Leu Phe Val Gin Lys He Phe Ala Asp Val Asn Lys Glu He 
365 370 375 

GAA GCA GTT GCT AAT ACT GAA AAG AAA GCA GAA AAA GCG GGT TAT GGT 12 65 

Glu Ala Val Ala Asn Thr Glu Lys Lys Ala Glu Lys Ala Gly Tyr Gly 
380 385 390 

TAT AGT AAA AGG ATG TAGGCATAAG AAAATAAGAA CAC C AT AAAA TCGTTTTTAG C 13 21 
Tyr Ser Lys Arg Met 
395 

TTCTAGGAGA CATCAGTCAG TTTCTTGCC 13 5 0 

(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 98 amino acids 
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(B) TYPE: amino- acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Met Glu Ser Val Lys Thr Gly Lys Thr Asn Lys Val Gly Lys Asn Thr 

15 10 15 

Glu Met Ala Asn Thr Lys Ala Asn Lys Glu Thr His Phe Lys Gin Val 

20 25 30 

Ser Ala lie Thr Asn lie lie Arg Ser Val Gly Gly Phe Phe Thr Lys 

35 40 45 

lie Ala Lys Arg Val Arg Gly Leu Val Lys Lys His Pro Lys Lys Ser 

50 55 60 

Ser Ala Ala Leu Val Val Leu Thr His lie Ala Cys Lys Lys Ala Lys 
65 70 75 80 

Glu Leu Asp Asp Lys Val Gin Asp Lys Ser Lys Gin Ala Glu Lys Glu 

85 90 95 

Asn Gin lie Asn Trp Trp Lys Tyr Ser Gly Leu Thr lie Ala Ala Ser 

100 105 110 

Leu Leu Leu Ala Ala Cys Ser Ala Gly Asp Thr Asp Lys Gin lie Glu 

115 120 125 

Leu Glu Gin Glu Lys Lys Glu Ala Glu Asn Ala Arg Asp Arg Ala Asn 

130 135 140 

Lys Ser Gly lie Glu Leu Glu Gin Glu Arg Gin Lys Thr Asn Lys Ser 
145 150 155 160 

Gly lie Glu Leu Ala Asn Ser Gin lie Lys Ala Glu Gin Glu Arg Gin 

165 170 175 

Lys Thr Glu Gin Glu Lys Gin Lys Ala Asn Lys Ser Ala lie Glu Leu 

180 185 190 

Glu Gin Gin Lys Gin Lys Thr lie Asn Thr Gin Arg Asp Leu He Lys 

195 200 205 

Glu Gin Lys Asp Phe He Lys Glu Thr Glu Gin Asn Cys Gin Glu Asn 

210 215 220 

His Asn Gin Phe Phe He Lys Lys Leu Gly He Lys Gly Gly He Ala 
225 230 235 240 

He Glu Val Glu Ala Glu Cys Lys Thr Pro Lys Pro Ala Lys Thr Asn 

245 250 255 

Gin Thr Pro He Gin Pro Lys His Leu Pro Asn Ser Lys Gin Pro His 

260 265 270 

Ser Gin Arg Gly Ser Lys Ala Gin Glu Phe He Ala Tyr Leu Gin Lys 

275 280 285 

Glu Leu Glu Phe Leu Pro Tyr Ser Gin Lys Ala He Ala Lys Gin Val 

290 295 300 

Asn Phe Tyr Lys Pro Ser Ser He Ala Tyr Leu Glu Leu Asp Pro Arg 
305 310 315 320 

Asp Phe Lys Val Thr Glu Glu Trp Gin Lys Glu Asn Leu Lys He Arg 

325 330 335 

Ser Lys Ala Gin Ala Lys Met Leu Glu Met Arg Asp Leu Lys Pro Asp 

340 345 350 

Pro Gin Ala His Leu Pro Thr Ser Gin Ser Leu Leu Phe Val Gin Lys 
355 360 365 
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Ile Phe Ala Asp Val Asn Lys Glu Ile-Glu Ala Val Ala Asn Thr Glu 

370 375 380 

Lys Lys Ala Glu Lys Ala Gly Tyr Gly Tyr Ser Lys Arg Met 
385 390 395 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 336... 443 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 

TAAGGGATAT TGCTAACGAT TAAGCTGTAT TGGAAGAGTT TATTTTGCAA GAATTAATCT 60 
TGCCTTGTGT GATTAGTAAC ACAAGGCAAG TGTGATAAAC C C TACT AC AA TTTCAATTCA 12 0 

AGGAGCCTAA CTAAAATAAA ATGAACAATT TCAGTTAGGG CTTTATTATA G C AAAAATT A 180 
TCTAAGATTA CAAAGGGTAG CGTTTCTGTT TTTGGATTTA GAGCGTTATT TTGATTGTTT 24 0 

TGAGTTTAAT TTACTTTTTG TTTAATAATA AATCTTAACT ATCATAAATG TACAATTAAA 3 00 

GTATTTAAAA AAATTTTAAA ACAAAAGGAT ATAAA ATG AAA ACC ATT AGA AAT 35 3 

Met Lys Thr lie Arg Asn 
1 5 

AGC GTG TTT ATT GGA GCG TCT TTA CTC GGC GGT TGC GCT AGC GTT GAG 4 01 

Ser Val Phe lie Gly Ala Ser Leu Leu Gly Gly Cys Ala Ser Val Glu 
10 15 20 

GCT TAT TTT GAC GCT TTG CAT GTT GCT CGC GTT AAA GAC GCT TGTTTATAG 452 
Ala Tyr Phe Asp Ala Leu His Val Ala Arg Val Lys Asp Ala 
25 30 35 

AAAAAGAAGC ACACCACACG CCCAAAGACT TTG AT AG C C C TTACCACACT GACTAAACCG 512 

G C ACT AGGTT TTAGTTGGGG GTTTTTAGGG GTGTTATTTT AGATACTCTC TGTTCCCTTA 572 

AAGAAAATAA ATTTCTACCA TAAAATAAAA TCTTAAATTA AGGCGACTAA AACCCCACTT 632 

TTAAAAAATT AAAAAGCGTT AAGTAAGACT TATCCAAAAA GCAAAGAAAA TCAATTTTTC 6 92 

CAACCACTTT TTTTAAG 709 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

- (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Met Lys Thr lie Arg Asn Ser Val Phe lie Gly Ala Ser Leu Leu Gly 

15 10 15 

Gly Cys Ala Ser Val Glu Ala Tyr Phe Asp Ala Leu His Val Ala Arg 

20 25 30 

Val Lys Asp Ala 
35 

(2) INFORMATION FOR SEQ ID NO: 151: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 19 . . . 837 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

AGATAGGAAT GTAAAGGA ATG GAA TTT ATG AAA AAG TTT GTA GCT TTA GGG 51 

Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly 
15 10 

CTT CTA TCC GCA GTT TTA AGC TCT TCG TTG TTA GCC GAA GGT GAT GGT 99 
Leu Leu Ser Ala Val Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly 
15 20 25 

GTT TAT' ATA GGG ACT AAT TAT CAG CTT GGA CAA GCC CGT TTG AAT AGT 14 7 

Val Tyr lie Gly Thr Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser 
30 35 40 

AAT ATT TAT AAT ACA GGG GAT TGC ACA GGG AGT GTT GTA GGT TGC CCC 195 
Asn lie Tyr Asn Thr Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro 
45 50 55 

CCA GGT CTT ACC GCT AAT AAG CAT AAT CCA GGA GGC ACC AAT ATC AAT 24 3 

Pro Gly Leu Thr Ala Asn Lys His Asn Pro Gly Gly Thr Asn lie Asn 
60 65 70 75 

TGG CAT GCT AAA TAC GCT AAT GGG GCT TTG AAT GGT CTT GGG TTG AAT 2 91 

Trp His Ala Lys Tyr Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn 
- 80 85 90 
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GTG GGT TAT AAG AAG TTC TTC CAG TTC AAG TCT TTT GAT ATG ACA AGC 33 9 

Val Gly Tyr Lys Lys Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser 
95 100 105 

AAG TGG TTT GGT TTT AGA GTG TAT GGG CTT TTT GAT TAT GGG CAT GCC 3 87 

Lys Trp Phe Gly Phe Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala 
110 115 120 

ACT TTA GGC AAG CAA GTT TAT GCA CCT AAT AAA ATC CAG TTG GAT ATG 4 35 

Thr Leu Gly Lys Gin Val Tyr Ala Pro Asn Lys lie Gin Leu Asp Met 
125 130 135 

GTC TCT TGG GGT GTG GGG AGC GAT TTG TTA GCT GAT ATT ATT GAT AAC 4 83 

Val Ser Trp Gly Val Gly Ser Asp Leu Leu Ala Asp lie lie Asp Asn 
140 145 150 155 

GAT AAC GCT TCT TTT GGT ATT TTT GGT GGG GTC GCT ATC GGC GGT AAC 531 
Asp Asn Ala Ser Phe Gly He Phe Gly Gly Val Ala He Gly Gly Asn 
160 165 170 

ACT TGG AAA AGC TCA GCG GCA AAC TAT TGG AAA GAG CAA ATC ATT GAA 57 9 

Thr Trp Lys Ser Ser Ala Ala Asn Tyr Trp Lys Glu Gin lie lie Glu 
175 180 185 

GCT AAG GGT CCT GAT GTT TGT ACC CCT ACT TAT TGT AAC CCT AAC GCT 62 7 

Ala Lys Gly Pro Asp Val Cys Thr Pro Thr Tyr Cys Asn Pro Asn Ala 
190 195 200 

CCT TAT AGC ACC AAA ACT TCA ACC GTC GCT TTT CAG GTA TGG TTG AAT 67 5 

Pro Tyr Ser Thr Lys Thr Ser Thr Val Ala Phe Gin Val Trp Leu Asn 
205 210 215 

TTT GGG- GTG AGA GCC AAT ATT TAC AAG CAT AAT GGC GTA GAG TTT GGC 72 3 

Phe Gly Val Arg Ala Asn He Tyr Lys His Asn Gly Val Glu Phe Gly 
220 225 230 235 

GTG AGA GTG CCG CTA CTC ATC AAC AAG TTT TTG AGT GCG GGT CCT AAC 771 
Val Arg Val Pro Leu Leu He Asn Lys Phe Leu Ser Ala Gly Pro Asn 
240 245 250 

GCT ACT AAT CTT TAT TAC CAT TTG AAA CGG GAT TAT TCG CTT TAT TTA 819 
Ala Thr Asn Leu Tyr Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu 
255 260 265 

GGG TAT AAC TAC ACT TTT TAAACCCTTT AAAAGGGTGT CTTTAAGCCC TTTTTAGT 8 75 
Gly Tyr Asn Tyr Thr Phe 
270 



CCTTATAAAA AGG 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 73 amino acids 

(B) TYPE: amino acid 



88* 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Met Glu Phe Met Lys Lys Phe Val Ala Leu Gly Leu Leu Ser Ala Val 

15 10 15 

Leu Ser Ser Ser Leu Leu Ala Glu Gly Asp Gly Val Tyr lie Gly Thr 

20 25 30 

Asn Tyr Gin Leu Gly Gin Ala Arg Leu Asn Ser Asn lie Tyr Asn Thr 

35 40 45 

Gly Asp Cys Thr Gly Ser Val Val Gly Cys Pro Pro Gly Leu Thr Ala 

50 55 60 

Asn Lys His Asn Pro Gly Gly Thr Asn lie Asn Trp His Ala Lys Tyr 
65 70 75 80 

Ala Asn Gly Ala Leu Asn Gly Leu Gly Leu Asn Val Gly Tyr Lys Lys 

85 90 95 

Phe Phe Gin Phe Lys Ser Phe Asp Met Thr Ser Lys Trp Phe Gly Phe 

100 105 110 

Arg Val Tyr Gly Leu Phe Asp Tyr Gly His Ala Thr Leu Gly Lys Gin 

115 120 125 

Val Tyr Ala Pro Asn Lys lie Gin Leu Asp Met Val Ser Trp Gly Val 

130 135 140 

Gly Ser Asp Leu Leu Ala Asp lie He Asp Asn Asp Asn Ala Ser Phe 
145 150 155 160 

Gly He Phe Gly Gly Val Ala He Gly Gly Asn Thr Trp Lys Ser Ser 

165 170 175 

Ala Ala Asn Tyr Trp Lys Glu Gin He He Glu Ala Lys Gly Pro Asp 

180 185 190 

Val Cys Thr Pro Thr Tyr Cys Asn Pro Asn Ala Pro Tyr Ser Thr Lys 

195 200 205 

Thr Ser Thr Val Ala Phe Gin Val Trp Leu Asn Phe Gly Val Arg Ala 

210 215 220 

Asn He Tyr Lys His Asn Gly Val Glu Phe Gly Val Arg Val Pro Leu 
225 230 235 240 

Leu lie Asn Lys Phe Leu Ser Ala Gly Pro Asn Ala Thr Asn Leu Tyr 

245 250 255 

Tyr His Leu Lys Arg Asp Tyr Ser Leu Tyr Leu Gly Tyr Asn Tyr Thr 
260 265 270 

Phe 



(2) INFORMATION FOR SEQ ID NO:153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<ix) FEATURE: 
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( A) NAME / KK x : Coding Sequence 

(B) LOCATION: 10... 279 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



AAAAGGAGA GTG GCG GTG AAA AAA ATC GTT GTG AGT TGG TGT GTG GCG TTG 51 
Val Ala Val Lys Lys lie Val Val Ser Trp Cys Val Ala Leu 
15 10 



GCT 


TTT 


TTA 


AGC 


GCG 


GAT 


TCA 


GCA 


CAA 


GCC 


AAT 


AAA 


GCG 


ATC 


AGT 


AAT 


99 


Ala 


Phe 


Leu 


Ser 


Ala 


Asp 


Ser 


Ala 


Gin 


Ala 


Asn 


Lys 


Ala 


He 


Ser 


Asn 




15 










20 










25 










30 




GCG 


GAT 


TTG 


ATT 


AAA 


GAG 


ATA 


AGG 


GAT 


TTA 


AAA 


AAA 


ATC 


ATC 


AGC 


GCG 


147 


Ala 


Asp 


Leu 


He 


Lys 


Glu 


He 


Arg 


Asp 


Leu 


Lys 


Lys 


He 


He 


Ser 


Ala 










35 










40 










45 






CAA 


AAC 


ACT 


GAG 


ATT 


AAC 


AAC 


TTA 


AGA 


AAA 


GTG 


CAA 


GAA 


GTG 


TTG 


TCT 


195 


Gin 


Asn 


Thr 


Glu 


He 


Asn 


Asn 


Leu 


Arg 


Lys 


Val 


Gin 


Glu 


Val 


Leu 


Ser 










50 










55 










60 








GGG 


CAA 


TTA 


GGG 


GAC 


ATG 


CGT 


AAG 


GAT 


ATA 


TTA 


AGC 


ACT 


AGA 


GAT 


TAT 


243 


Gly 


Gin 


Leu 


Gly 


Asp 


Met 


Arg 


Lys 


Asp 


He 


Leu 


Ser 


Thr 


Arg 


Asp 


Tyr 






65 










70 










75 










TGC 


ATT 


AGC 


TTA 


AGG 


CCT 


TAT 


ATC 


TAT 


AAT 


TGG 


CGC 


TAGGGGATAA ' 


TCCAAA 


295 


Cys 


He 


Ser 


Leu 


Arg 


Pro 


Tyr 


He 


Tyr 


Asn 


Trp 


Arg 












80 










85 










90 













AAATGAAAGC ATGCG 310 
(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 154 : 

Val Ala Val Lys Lys He Val Val Ser Trp Cys Val Ala Leu Ala Phe 

1 5 10 15 

Leu Ser Ala Asp Ser Ala Gin Ala Asn Lys Ala He Ser Asn Ala Asp 

20 25 30 

Leu He Lys Glu lie Arg Asp Leu Lys Lys He He Ser Ala Gin Asn 

35 40 45 

Thr Glu He Asn Asn Leu Arg Lys Val Gin Glu Val Leu Ser Gly Gin 

50 55 60 

Leu Gly Asp Met Arg Lys Asp He Leu Ser Thr Arg Asp Tyr Cys He 
65 70 75 80 
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Ser Leu Arg Pro Tyr lie Tyr Asn Trp Arg 
85 90 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 16... 474 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

TGTTAAGATC AGTTT ATG GAA CAA AAT ATT TTC TCC TTA CTC ATT CAA AAA 51 
Met Glu Gin Asn lie Phe Ser Leu Leu lie Gin Lys 
15 10 
AAG TCT TAT AAA AAG CTT GAA ACC CTT TTG AAA CTC AAA AAG CTT AAG 9 9 

Lys Ser Tyr Lys Lys Leu Glu Thr Leu Leu Lys Leu Lys Lys Leu Lys 
15 20 25 

GTT TTT ATG CCT TTA AGT TTA CAA GAA AAT TTG CTT TTT ATC TTC ATA 14 7 

Val Phe Met Pro Leu Ser Leu Gin Glu Asn Leu Leu Phe lie Phe lie 
30 35 40 

AAA GAC TCT AAA TTG CTT TTT GCG TTT AAA GAC ATT TGG GCT TCT AAA 195 
Lys Asp Ser Lys Leu Leu Phe Ala Phe Lys Asp lie Trp Ala Ser Lys 
45 50 55 60 

GAA TTT AAC CAA CGA TTC GCT AAA GAA ATC AGC CAT TTT TTA AAC ACG 24 3 

Glu Phe Asn Gin Arg Phe Ala Lys Glu lie Ser His Phe Leu Asn Thr 
65 70 75 

CAA GGG CAT GCT TAT GGG TTT GAC GGG TTG AAT GGG TTA GAA ATT TTA 2 91 

Gin Gly His Ala Tyr Gly Phe Asp Gly Leu Asn Gly Leu Glu He Leu 
80 85 90 

GGT TAT GTG CCT AAA GAC GCG CTA AAA AAA TCC AAT TTT TAT GCC CCC 3 39 

Gly Tyr Val Pro Lys Asp Ala Leu Lys Lys Ser Asn Phe Tyr Ala Pro 
95 100 105 

ATT AAA AAA CAA GCC CGT TTT TTT CGC CCT AGT GCT TTA GGG TTG TTC 3 87 

He Lys Lys Gin Ala Arg Phe Phe Arg Pro Ser Ala Leu Gly Leu Phe 
110 115 120 

CAT AAC CCC ATT AAA GAC GCT CGT TTG CAT GAA TGT TTT GAA AAA GCG 43 5 
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His Asn Pro lie Lys Asp Ala Arg Leu His Glu Cys Phe Glu Lys Ala 
125 130 135 140 

CGC GCT TTG ATC CAC TAC CAA CGA AGT TTT TTT GAG GAA TGAATGGCTG AT 4 86 
Arg Ala Leu lie His Tyr Gin Arg Ser Phe Phe Glu Glu 
145 150 

TTATTGTC C A GTTTAAAAAA CCTTCCTAAC AGCAGTGGCG TGTATCAATA TTTTGATAAA 54 6 

AAC 549 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



Met 


Glu 


Gin 


Asn 


lie 


Phe 


Ser 


Leu 


Leu 


He 


Gin 


Lys 


Lys 


Ser 


Tyr 


Lys 


1 








5 










10 










15 




Lys 


Leu 


Glu 


Thr 


Leu 


Leu 


Lys 


Leu 


Lys 


Lys 


Leu 


Lys 


Val 


Phe 


Met 


Pro 






20 










25 










30 






Leu 


Ser 


Leu 


Gin 


Glu 


Asn 


Leu 


Leu 


Phe 


He 


Phe 


He 


Lys 


Asp 


Ser 


Lys 






35 










40 










45 








Leu 


Leu 


Phe 


Ala 


Phe 


Lys 


Asp 


He 


Trp 


Ala 


Ser 


Lys 


Glu 


Phe 


Asn 


Gin 




50 










55 










60 










Arg 


Phe 


Ala 


Lys 


Glu 


He 


Ser 


His 


Phe 


Leu 


Asn 


Thr 


Gin 


Gly 


His 


Ala 


65 










70 










75 










80 


Tyr 


Gly 


Phe 


Asp 


Gly 


Leu 


Asn 


Gly 


Leu 


Glu 


He 


Leu 


Gly 


Tyr 


Val 


Pro 








85 










90 










95 




Lys 


Asp 


Ala 


Leu 


Lys 


Lys 


Ser 


Asn 


Phe 


Tyr 


Ala 


Pro 


He 


Lys 


Lys 


Gin 




100 










105 










110 






Ala 


Arg 


Phe 


Phe 


Arg 


Pro 


Ser 


Ala 


Leu 


Gly 


Leu 


Phe 


His 


Asn 


Pro 


He 




115 










120 










125 








Lys 


Asp 


Ala 


Arg 


Leu 


His 


Glu 


Cys 


Phe 


Glu 


Lys 


Ala 


Arg 


Ala 


Leu 


He 




130 










135 










140 










His 


Tyr 


Gin 


Arg 


Ser 


Phe 


Phe 


Glu 


Glu 
















145 










150 























(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2627 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 
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( A) NAME/Kfci: Coding Sequence 

(B) LOCATION: 18... 2582 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 7: 

AAAGACATGT GCAACCG ATG AAA TCT AAA AAA CTT TAT TTG GCT TTA ATC 50 

Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu lie 
15 10 

ATA GGG GTT TTA TTA GCG TTT TTA ACC CTA TCT TCA TGG CTG GGT AAT 98 

lie Gly Val Leu Leu Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn 
15 20 25 

AGC GGT TTA GTG GGG CGT TTT GGG GTG TGG TTT GCC GCA CTC AAT AAA 14 6 

Ser Gly Leu Val Gly Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys 
30 35 40 

AAA TAT TTT GGG CAT CTT TCA TTC ATT AAT TTA CCC TAT TTA GCA TGG 194 

Lys Tyr Phe Gly His Leu Ser Phe lie Asn Leu Pro Tyr Leu Ala Trp 

45 50 55 

GTT TTA TTC CTT TTA TAC AAG ACT AAA AAC CCT TTT ACA GAA ATC GTT 242 

Val Leu Phe Leu Leu Tyr Lys Thr Lys Asn Pro Phe Thr Glu lie Val 
60 65 70 75 

TTA GAA AAA ACT TTA GGG CAT CTA TTA GGC ATT TTA TCT TTG CTC TTT 2 90 

Leu Glu Lys Thr Leu Gly His Leu Leu Gly lie Leu Ser Leu Leu Phe 

80 85 90 

TTA CAA TCT AGC CTA TTA AAT CAA GGG GAA ATC GGC AAC AGC GCG CGT 3 38 

Leu Gin Ser Ser Leu Leu Asn Gin Gly Glu lie Gly Asn Ser Ala Arg 
95 100 105 

TTG TTT TTA CGC CCT TTT ATA GGG GAT TTT GGG CTT TAT GCG CTG ATA 3 86 

Leu Phe Leu Arg Pro Phe lie Gly Asp Phe Gly Leu Tyr Ala Leu lie 
110 115 120 

ACG CTT ATG GTA GTT ATT TCT TAT TTG ATT CTA TTC AAA CTA CCC CCT 4 34 

Thr Leu Met Val Val lie Ser Tyr Leu lie Leu Phe Lys Leu Pro Pro 
125 130 135 

AAA AGC GTT TTT TAT CCT TAT ATG AAC AAA ACA CAA AAC CTT TTA AAA 4 82 

Lys Ser Val Phe Tyr Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys 
140 145 150 155 

GAG ATT TAC AAA CAA TGC TTA CAA GCC TTT AGC CCT AAT TTT AGC CCA 53 0 

Glu lie Tyr Lys Gin Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro 

160 165 170 

AAA AAA GAG GGT TTT GAA AAC ACC CCA TCA GAT ATT CAA AAA AAA GAA 57 8 

Lys Lys Glu Gly Phe Glu Asn Thr Pro Ser Asp lie Gin Lys Lys Glu 
175 180 185 
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ACC AAA AAC GAC AAA GAA • AAA GAA AAC -CGC AAA GAA AAC CCT ATT AAT 626 
Thr Lys Asn Asp Lys Glu Lys Glu Asn Arg Lys Glu Asn Pro lie Asn 
190 135 200 

GAA AAC CAC AAA ACC CCT AAC GAA GAA CCG TTT TTA GCG ATC CCT ACC 674 
Glu Asn His Lys Thr Pro Asn Glu Glu Pro Phe Leu Ala lie Pro Thr 
205 210 215 

CCC TAT AAC ACG ACT TTA AAT GAT TCA GAG CCG CAA GAA GGC TTA GTC 722 
Pro Tyr Asn Thr Thr Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val 
220 225 230 235 

CAA ATT TCC TCC CAC CCC CCT ACC CAT TAC ACC ATT TAC CCT AAA AGA 7 70 

Gin He Ser Ser His Pro Pro Thr His Tyr Thr He Tyr Pro Lys Arg 
240 245 250 

AAC CGA TTT GAT GAT TTG ACT AAC CCC ACT AAC CCC CCT TTA AAA GAA 818 
Asn Arg Phe Asp Asp Leu Thr Asn Pro Thr Asn Pro Pro Leu Lys Glu 
255 260 265 

ATT AAA CAA GAA ACT AAA GAA AGA GAA CCC ACG CCT ACA AAA GAA ACT 86 6 

He Lys Gin Glu Thr Lys Glu Arg Glu Pro Thr Pro Thr Lys Glu Thr 
270 275 280 

CTT ACG CCC ACC ACG CCC AAA CCT ATC ATG CCC ACA CTT GCA CCC ATA 914 
Leu Thr Pro Thr Thr Pro Lys Pro He Met Pro Thr Leu Ala Pro lie 
285 290 295 

ATA GAA AAT GAC AAC AAA ACA GAA AAC CAA AAA ACC CCC AAC CAC CCT 96 2 

He Glu Asn Asp Asn Lys Thr Glu Asn Gin Lys Thr Pro Asn His Pro 
300 305 310 315 

AAA AAA GAA GAA AAC CCA CAA GAA AAC ACG CAA GAA GAA ATG ATA GAA 1010 
Lys Lys Glu Glu Asn Pro Gin Glu Asn Thr Gin Glu Glu Met He Glu 
320 325 330 

GGA AGG ATA GAA GAA ATG ATA AAG GAA AAT CTA AAA AAA GAA GAA AAA 10 5 8 

Gly Arg He Glu Glu Met He Lys Glu Asn Leu Lys Lys Glu Glu Lys 
335 340 345 

GAA GTG CAA AAC GCT CCA AAC TTT AGC CCA GTA ACC CCC ACA AGC GCT 1106 
Glu Val Gin Asn Ala Pro Asn Phe Ser Pro Val Thr Pro Thr Ser Ala 
350 355 360 

AAA AAA CCC GTT ATG GTT AAA GAA TTG AGC GAA AAT AAA GAG ATA TTA 1154 
Lys Lys Pro Val Met Val Lys Glu Leu Ser Glu Asn Lys Glu He Leu 
365 370 375 

GAC GGA TTG GAT TAT GGC GAA GTG CAA AAA CCC AAA GAT TAT GAG CTT 12 0 2 

Asp Gly Leu Asp Tyr Gly Glu Val Gin Lys Pro Lys Asp Tyr Glu Leu 
380 385 390 395 

CCC ACC ACG CAA TTA TTG AAT GCG GTT TGT TTG AAA GAC ACT TCT TTA 12 5 0 

Pro Thr Thr Gin Leu Leu Asn Ala Val Cys Leu Lys Asp Thr Ser Leu 
400 405 410 
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GAC GAA AAC GAG ATT GAC CAA AAA ATC CAG GAT CTA TTG AGC AAA CTG 12 98 

Asp Glu Asn Glu lie Asp Gin Lys lie Gin Asp Leu Leu Ser Lys Leu 
415 420 425 

CGC ACC TTT AAA ATT GAT GGC GAT ATT ATC CGC ACT TAT TCA GGC CCT 134 6 

Arg Thr Phe Lys lie Asp Gly Asp lie lie Arg Thr Tyr Ser Gly Pro 
430 435 440 

ATT GTA ACC ACT TTT GAA TTC CGC CCA GCC CCT AAC GTT AAG GTG AGT 13 94 

lie Val Thr Thr Phe Glu Phe Arg Pro Ala Pro Asn Val Lys Val Ser 
445 450 455 

CGT ATT TTA GGC TTG AGC GAT GAT TTA GCG ATG ACT TTA TGC GCT GAA 144 2 

Arg lie Leu Gly Leu Ser Asp Asp Leu Ala Met Thr Leu Cys Ala Glu 
460 465 470 475 

TCC ATC CGC ATT CAA GCC CCT ATT AAG GGT AAA GAT GTC GTT GGC ATT 14 90 

Ser lie Arg lie Gin Ala Pro He Lys Gly Lys Asp Val Val Gly He 
480 485 490 

GAA ATC CCT AAC AGC CAA AGC CAA ATT ATT TAT TTA AGA GAA ATT CTA 15 3 8 

Glu He Pro Asn Ser Gin Ser Gin He He Tyr Leu Arg Glu He Leu 
495 500 505 

GAG AGC GAA TTG TTT CAA AAA TCC AGC TCG CCC TTA ACT CTA GCT TTA 15 86 

Glu Ser Glu Leu Phe Gin Lys Ser Ser Ser Pro Leu Thr Leu Ala Leu 
510 515 520 

GGC AAA GAC ATT GTG GGT AAC CCT TTC ATC ACG GAT TTA AAA AAG CTC 1634 
Gly Lys Asp He Val Gly Asn Pro Phe He Thr Asp Leu Lys Lys Leu 
525 530 535 

CCC CAT TTG CTC ATC GCT GGC ACG ACA GGA AGC GGT AAG AGC GTG GGC 168 2 

Pro His Leu Leu He Ala Gly Thr Thr Gly Ser Gly Lys Ser Val Gly 
540 545 550 555 

GTG AAT GCG ATG ATT TTA TCC TTA CTT TAT AAA AAC CCT CCC GAT CAA 17 3 0 

Val Asn Ala Met He Leu Ser Leu Leu Tyr Lys Asn Pro Pro Asp Gin 
560 565 570 

CTC AAA TTA GTG ATG ATC GAT CCC AAA ATG GTA GAA TTT AGT ATT TAT 17 7 8 

Leu Lys Leu Val Met He Asp Pro Lys Met Val Glu Phe Ser He Tyr 
575 580 585 

GCG GAT ATC CCT CAT TTG CTC ACG CCC ATT ATC ACC GAC CCT AAA AAA 182 6 

Ala Asp He Pro His Leu Leu Thr Pro He He Thr Asp Pro Lys Lys 
590 595 600 

GCT ATT GGG GCT TTG CAA AGC GTG GCT AAA GAA ATG GAA CGC CGG TAT 18 74 

Ala He Gly Ala Leu Gin Ser Val Ala Lys Glu Met Glu Arg Arg Tyr 
605 610 615 

TCT TTA ATG AGC GAA TAC AAG GTT AAA ACC ATT GAT TCT TAT AAT GAA 1922 
Ser Leu Met Ser Glu Tyr Lys Val Lys Thr He Asp Ser Tyr Asn Glu 
620 625 630 635 
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CAA GCC CCA AGT AAC GGC GTT GAA GCG TTC CCC TAT TTG ATT GTG GTG 197 0 

Gin Ala Pro Ser Asn Gly Val Glu Ala Phe Pro Tyr Leu lie Val Val 
640 645 650 

ATT GAT GAA TTA GCG GAT TTA ATG ATG ACA GGG GGC AAA GAA GCG GAG 2 018 

lie Asp Glu Leu Ala Asp Leu Met Met Thr Gly Gly Lys Glu Ala Glu 
655 660 665 

TTT CCT ATC GCT AGA ATC GCT CAA ATG GGG CGC GCG AGC GGC TTA CAC 2 066 

Phe Pro lie Ala Arg lie Ala Gin Met Gly Arg Ala Ser Gly Leu His 
670 675 680 

CTC ATT GTA GCG ACC CAA CGC CCA AGC GTG GAT GTC GTA ACC GGC TTG 2114 

Leu lie Val Ala Thr Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu 

685 690 695 

ATT AAA ACC AAC TTG CCT TCA AGG GTG AGT TTT AGG GTA GGC ACT AAG 2162 

lie Lys Thr Asn Leu Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys 
700 705 710 715 

ATT GAT TCT AAA GTG ATT TTA GAC ACT GAT GGG GCG CAA AGC TTG TTA 2210 
lie Asp Ser Lys Val lie Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu 
720 725 730 

GGA AGA GGC GAT ATG CTC TTT ACC CCC CCA GGA GCG AAC GGG TTA GTG 22 58 

Gly Arg Gly Asp Met Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val 
735 740 745 

CGC TTG CAT GCC CCC TTT GCC ACT GAA GAT GAA ATC AAA AAA ATC GTG 23 06 

Arg Leu His Ala Pro Phe Ala Thr Glu Asp Glu lie Lys Lys He Val 
750 755 760 

GAT TTT ATT AAA GCC CAA AAA GAA GTA CAA TAC GAT AAA GAT TTC TTG 2 3 54 

Asp Phe He Lys Ala Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu 
765 770 775 

CTA GAA GAA TCA CGC ATG CCT TTA GAC ACC CCT AAT TAT CAA GGC GAT 24 02 

Leu Glu Glu Ser Arg Met Pro Leu Asp Thr Pro Asn Tyr Gin Gly Asp 
780 785 790 795 

GAC ATT TTA GAA AGG GCT AAA GCG GTG ATT TTA GAA AAA AAG ATC ACT 24 5 0 

Asp He Leu Glu Arg Ala Lys Ala Val He Leu Glu Lys Lys He Thr 
800 805 810 

TCT ACG AGT TTT TTA CAA CGC CAA TTA AAA ATC GGC TAC AAC CAA GCC 24 98 

Ser Thr Ser Phe Leu Gin Arg Gin Leu Lys He Gly Tyr Asn Gin Ala 
815 820 825 

GCT ACC ATT ACT GAC GAA TTA GAA GCT CAA GGC TTT TTA TCC CCA AGA 2 54 6 

Ala Thr He Thr Asp Glu Leu Glu Ala Gin Gly Phe Leu Ser Pro Arg 
830 835 840 

AAC GCT AAA GGC AAC AGA GAG ATT TTG CAA AAC TTT TAGGCTTTGT TTTCAT 2 5 98 
Asn Ala Lys Gly Asn Arg Glu He Leu Gin Asn Phe 
845 850 855 
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TGGATATTGG CAAACATTAT TTTTGATTT 2 62 7 

(2) INFORMATION FOR SEQ ID NO: 158: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 8: 
Met Lys Ser Lys Lys Leu Tyr Leu Ala Leu lie lie Gly Val Leu Leu 

15 10 15 

Ala Phe Leu Thr Leu Ser Ser Trp Leu Gly Asn Ser Gly Leu Val Gly 

20 25 30 

Arg Phe Gly Val Trp Phe Ala Ala Leu Asn Lys Lys Tyr Phe Gly His 

35 40 45 

Leu Ser Phe lie Asn Leu Pro Tyr Leu Ala Trp Val Leu Phe Leu Leu 

50 55 60 

Tyr Lys Thr Lys Asn Pro Phe Thr Glu lie Val Leu Glu Lys Thr Leu 
65 70 75 80 

Gly His Leu Leu Gly lie Leu Ser Leu Leu Phe Leu Gin Ser Ser Leu 

85 90 95 

Leu Asn Gin Gly Glu lie Gly Asn Ser Ala Arg Leu Phe Leu Arg Pro 

100 105 110 

Phe lie Gly Asp Phe Gly Leu Tyr Ala Leu lie Thr Leu Met Val Val 

115 120 125 

lie Ser Tyr Leu lie Leu Phe Lys Leu Pro Pro Lys Ser Val Phe Tyr 

130 135 140 

Pro Tyr Met Asn Lys Thr Gin Asn Leu Leu Lys Glu lie Tyr Lys Gin 
145 150 155 160 

Cys Leu Gin Ala Phe Ser Pro Asn Phe Ser Pro Lys Lys Glu Gly Phe 

165 170 175 

Glu Asn Thr Pro Ser Asp lie Gin Lys Lys Glu Thr Lys Asn Asp Lys 

180 185 190 

Glu Lys Glu Asn Arg Lys Glu Asn Pro lie Asn Glu Asn His Lys Thr 

195 200 205 

Pro Asn Glu Glu Pro Phe Leu Ala lie Pro Thr Pro Tyr Asn Thr Thr 

210 215 220 

Leu Asn Asp Ser Glu Pro Gin Glu Gly Leu Val Gin lie Ser Ser His 
225 230 235 240 

Pro Pro Thr His Tyr Thr lie Tyr Pro Lys Arg Asn Arg Phe Asp Asp 

245 250 255 

Leu Thr Asn Pro Thr Asn Pro Pro Leu Lys Glu lie Lys Gin Glu Thr 

260 265 270 

Lys Glu Arg Glu Pro Thr Pro Thr Lys Glu Thr Leu Thr Pro Thr Thr 

275 280 285 

Pro Lys Pro lie Met Pro Thr Leu Ala Pro lie lie Glu Asn Asp Asn 

290 295 300 

Lys Thr Glu Asn Gin Lys Thr Pro Asn His Pro Lys Lys Glu Glu Asn 
305 310 315 320 

Pro Gin Glu Asn Thr Gin Glu Glu Met lie Glu Gly Arg lie Glu Glu 
325 330 335 
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Met lie Lys Giu Asn beu Lys Lys Glu -Glu Lys Glu Val Gin Asn Ala 

340 345 350 

Pro Asn Phe Ser Pro Val Thr Pro Thr Ser Ala Lys Lys Pro Val Met 

355 360 365 

Val Lys Glu Leu Ser Glu Asn Lys Glu lie Leu Asp Gly Leu Asp Tyr 

370 375 380 

Gly Glu Val Gin Lys Pro Lys Asp Tyr Glu Leu Pro Thr Thr Gin Leu 
385 390 395 400 

Leu Asn Ala Val Cys Leu Lys Asp Thr Ser Leu Asp Glu Asn Glu lie 

405 410 415 

Asp Gin Lys He Gin Asp Leu Leu Ser Lys Leu Arg Thr Phe Lys He 

420 425 430 

Asp Gly Asp He He Arg Thr Tyr Ser Gly Pro He Val Thr Thr Phe 

435 440 445 

Glu Phe Arg Pro Ala Pro Asn Val Lys Val Ser Arg He Leu Gly Leu 

450 455 460 

Ser Asp Asp Leu Ala Met Thr Leu Cys Ala Glu Ser He Arg He Gin 
465 470 475 480 

Ala Pro lie Lys Gly Lys Asp Val Val Gly He Glu He Pro Asn Ser 

485 490 495 

Gin Ser Gin He He Tyr Leu Arg Glu He Leu Glu Ser Glu Leu Phe 

500 505 510 

Gin Lys Ser Ser Ser Pro Leu Thr Leu Ala Leu Gly Lys Asp lie Val 

515 520 525 

Gly Asn Pro Phe He Thr Asp Leu Lys Lys Leu Pro His Leu Leu lie 

530 535 540 

Ala Gly Thr Thr Gly Ser Gly Lys Ser Val Gly Val Asn Ala Met lie 
545 550 555 560 

Leu Ser Leu Leu Tyr Lys Asn Pro Pro Asp Gin Leu Lys Leu Val Met 

565 570 575 

He Asp Pro Lys Met Val Glu Phe Ser He Tyr Ala Asp He Pro His 

580 585 590 

Leu Leu Thr Pro lie lie Thr Asp Pro Lys Lys Ala He Gly Ala Leu 

595 600 605 

Gin Ser Val Ala Lys Glu Met Glu Arg Arg Tyr Ser Leu Met Ser Glu 

610 615 620 

Tyr Lys Val Lys Thr He Asp Ser Tyr Asn Glu Gin Ala Pro Ser Asn 
625 630 635 640 

Gly Val Glu Ala Phe Pro Tyr Leu He Val Val He Asp Glu Leu Ala 

645 650 655 

Asp Leu Met Met Thr Gly Gly Lys Glu Ala Glu Phe Pro He Ala Arg 

660 665 670 

He Ala Gin Met Gly Arg Ala Ser Gly Leu His Leu lie Val Ala Thr 

675 680 685 

Gin Arg Pro Ser Val Asp Val Val Thr Gly Leu He Lys Thr Asn Leu 

690 695 700 

Pro Ser Arg Val Ser Phe Arg Val Gly Thr Lys He Asp Ser Lys Val 
705 710 715 720 

He Leu Asp Thr Asp Gly Ala Gin Ser Leu Leu Gly Arg Gly Asp Met 

725 730 735 

Leu Phe Thr Pro Pro Gly Ala Asn Gly Leu Val Arg Leu His Ala Pro 

740 745 750 

Phe Ala Thr Glu Asp Glu He Lys Lys He Val Asp Phe lie Lys Ala 

-755 760 765 

Gin Lys Glu Val Gin Tyr Asp Lys Asp Phe Leu Leu Glu Glu Ser Arg 
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770 

Met Pro Leu Asp 
785 

Ala Lys Ala Val 

Gin Arg Gin Leu 
820 

Glu Leu Glu Ala 
835 

Arg Glu lie Leu 
850 



775 

Thr Pro Asn Tyr 
790 

lie Leu Glu Lys 
805 

Lys lie Gly Tyr 

Gin Gly Phe Leu 
840 

Gin Asn Phe 
855 



-305- 

780 

Gin Gly Asp Asp 
795 

Lys He Thr Ser 
810 

Asn Gin Ala Ala 
825 

Ser Pro Arg Asn 



He Leu Glu Arg 
800 

Thr Ser Phe Leu 
815 

Thr He Thr Asp 
830 

Ala Lys Gly Asn 
845 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 56... 1945 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 9: 

GGTGTCCTTA AACAGCAGGG TGAAAGAGAT TTTAAAAGAA AGCGCTCTGC ATTCT ATG 5 8 

Met 
1 

CAA GAT AGT TTG CAT TTT AAG GTT AAT GAA GTG CAA GGG GTT TTA GAA 106 
Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu Glu 
5 10 15 

AAC ACT TAT ACG AGC ATG GGC ATT GTT AAA GAA ATG CTC CCT AAA GAC 154 
Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys Asp 
20 25 30 

ACC AAA AGA GAA ATC AAA ATC GGC TTG TTA AAA AAC TTC ATT TTA GCC 2 02 

Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu Ala 
35 40 45 

AAT TCG CAT GTC GCT GGG GTG AGC ATG TTT TTT AAA GGC AGA GAA GAT 2 50 

Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu Asp 
50 55 60 65 

TTA AGA TTA ACG CTT TTA AGG GAT AAC AAT ACG ATT AAG CTA GTG GAA 2 98 

Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val Glu 
70 75 80 
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-306- 

AAT CCG TCA TTA GALi AAT AGC CCT TTA GCG CAA AAA GCG ATG AAA AAT 34 6 

Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys Asn 
85 90 95 

AAA GAA ATT TCT AAA AGT TTG GGT TAT TAT AGG AAA ATG CCT AAT GGG 3 94 

Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn Gly 

100 105 110 

GCG GAA GTT TAT GGG GTG GAT ATT CTT TTA CCT TTA TTG AAT GAG AAC 442 
Ala Glu Val Tyr Gly Val Asp He Leu Leu Pro Leu Leu Asn Glu Asn 
115 120 125 

GCT CAA GAG GTT GTA GGG GCT TTG ATG ATT TTT ATT TCC ATT GAC AGC 4 90 

Ala Gin Glu Val Val Gly Ala Leu Met He Phe He Ser He Asp Ser 
130 135 140 145 

TTC AGC AAT GAA ATC ACT AAA AAC AGG AGC GAT TTA TTT TTA ATT GGC 53 8 

Phe Ser Asn Glu He Thr Lys Asn Arg Ser Asp Leu Phe Leu He Gly 
150 155 160 

ACT AAA GGT AAA GTG CTT TTG AGC GCG AAT AAG AGT TTG CAA GAC AAA 5 86 

Thr Lys Gly Lys Val Leu Leu Ser Ala Asn Lys Ser Leu Gin Asp Lys 
165 170 175 

CCT ATC GCA GAA ATT TAT AAG AGC GTG CCT AAA GCC ACC AAC GAA GTG 634 
Pro He Ala Glu He Tyr Lys Ser Val Pro Lys Ala Thr Asn Glu Val 
180 185 190 

ATG GCT ATT TTA GAA AAC GGC TCT AAA GCG ACT TTA GAA TAC TTA GAT 6 82 

Met Ala He Leu Glu Asn Gly Ser Lys Ala Thr Leu Glu Tyr Leu Asp 
195 200 205 

CCC TTT AGC CAT AAG GAA AAT TTT TTA GCC GTT GAA ACC TTT AAA ATG 73 0 

Pro Phe Ser His Lys Glu Asn Phe Leu Ala Val Glu Thr Phe Lys Met 
210 215 220 225 

CTA GGC AAA ACA GAA AGT AAA GAC AAT CTT AAT TGG ATG ATC GCT TTA 77 8 

Leu Gly Lys Thr Glu Ser Lys Asp Asn Leu Asn Trp Met He Ala Leu 
230 235 240 

ATC ATT GAA AAA GAC AAG GTC TAT GAG CAA GTA GGC TCG GTG CGT TTT 82 6 

He He Glu Lys Asp Lys Val Tyr Glu Gin Val Gly Ser Val Arg Phe 
245 250 255 

GTG GTG ATC ATA GCG AGC GCA ATC ATG GTG TTA GCC TTG ATT ATA GCG 8 74 

Val Val He He Ala Ser Ala He Met Val Leu Ala Leu He He Ala 
260 265 270 

ATC ACT CTC TTA ATG CGA GCG ATC GTG AGC AGT CGT TTG GAA GCC GTT 922 
He Thr Leu Leu Met Arg Ala He Val Ser Ser Arg Leu Glu Ala Val 
275 280 285 

TCT AGC ACC TTG TCT CAT TTC TTT AAA TTA TTG AAC AAT CAA GCC AAT 97 0 

Ser Ser Thr Leu Ser His Phe Phe Lys Leu Leu Asn Asn Gin Ala Asn 
290 295 300 305 
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TCT AGC GGT 
Ser Ser Gly 



CGC .ATG CAA 
Arg Met Gin 



ATG CAA GAA 

Met Gin Glu 
340 

GAT GTG AAA 

Asp Val Lys 
355 

AGC CCT GAT 

Ser Pro Asp 
370 

TAT TTG CAA 

Tyr Leu Gin 

TTT GAA AGC 
Phe Glu Ser 



TCG GGT AGG 
Ser Gly Arg 
420 

AAA ATG CTA 
Lys Met Leu 
435 

AGC GCG AAT 
Ser Ala Asn 
450 

TCC CAA CAC 
Ser Gin His 



ACC ACT TCC 
Thr Thr Ser 



CAA GGG CAA 
Gin Gly Gin 
500 

GAT CAA ACC 
Asp Gin Thr 
515 



ATT AAA TIG 
lie Lys Leu 
310 

ACA GCG ATC 
Thr Ala lie 
325 

GAC AGG CAA 
Asp Arg Gin 



GCA GGG AAT 
Ala Gly Asn 

TTG AAA GAA 
Leu Lys Glu 
375 

GAA AGC GTA 
Glu Ser Val 
390 

TAT TCT GGT 
Tyr Ser Gly 
405 

GTG GAA CTG 
Val Glu Leu 



GAA ACT TCG 
Glu Thr Ser 



TTA AAA GAG 
Leu Lys Glu 
455 

AAA AGC TTG 
Lys Ser Leu 
470 

ATT CAA GGC 
lie Gin Gly 
485 

GAC ATT AAA 
Asp lie Lys 



AAT CTT TTA 
Asn Leu Leu 



ATT GAA GCG 
lie Glu Ala 



AAT AAA AAT 
Asn Lys Asn 
330 

GCC GTC CAA 
Ala Val Gin 
345 

TTT GCG GTG 
Phe Ala Val 
360 

TTG AGG GAC 
Leu Arg Asp 



GGG ACT CAC 
Gly Thr His 

TTG GAT TTT 
Leu Asp Phe 
410 

GTT ACT AAC 
Val Thr Asn 
425 

TCT AAT TTT 
Ser Asn Phe 
440 

TGC GTG CAA 
Cys Val Gin 



ATG GAA ACT 
Met Glu Thr 



GTG AGC TCT 
Val Ser Ser 
490 

AGC ATT GTA 
Ser He Val 
505 

GCC TTA AAC 
Ala Leu Asn 
520 



307- 

AAA TCC AAT 
Lys Ser Asn 
315 

ATC TTG CAA 
He Leu Gin 



GAC ACC ATT 
Asp Thr He 



CGC ATC ACG 
Arg He Thr 
365 

GCG CTA AAT 
Ala Leu Asn 
380 

ATG CCA AGC 
Met Pro Ser 
395 

AGA GGC CGG 
Arg Gly Arg 



GCT TTA GGG 
Ala Leu Gly 

GCC AAA GAT 
Ala Lys Asp 
445 

AAT TTA GAA 
Asn Leu Glu 
460 

TCC AAA ACG 
Ser Lys Thr 
475 

CAA AGT GAA 
Gin Ser Glu 



GAA ATC ATT 
Glu He He 



GCC GCT ATT 
Ala Ala He 
525 



GAC GAG TTA 
Asp Glu Leu 
320 

ACC CAA AAA 
Thr Gin Lys 
335 

AAA GTG GTT 
Lys Val Val 
350 

GCT GAG CCC 
Ala Glu Pro 



GGG ATC ATG 
Gly He Met 



ATT TTC AAA 
He Phe Lys 
400 

ATC CAA AAC 
lie Gin Asn 
415 

CAA GAA ATC 
Gin Glu lie 
430 

TTA GCG AAC 
Leu Ala Asn 



AAA GCT TCA 
Lys Ala Ser 



ATA GAA AAT 
He Glu Asn 
480 

GCC ATG ATT 
Ala Met He 
495 

AGA GAT ATT 
Arg Asp He 
510 

GAA GCC GCA 
Glu Ala Ala 



GGC 1018 
Gly 



ATC 1066 
He 



TCA 1114 
Ser 



GCA 1162 
Ala 



GAT 1210 

Asp 

385 

ATC 12 5 8 

He 



GCT 1306 
Ala 



CAA 1354 
Gin 



GAT 14 02 

Asp 

AAC 1450 

Asn 

465 

ATC 14 98 

lie 



GAA 1546 
Glu 



GCT 15 94 

Ala 



AGG 1642 
Arg 



BNSDOCID: <WO 9821225A1_I_> 



WO 98/21225 



PCT/US97/21353 - 



-308- 

GCC GGC GAG CAT GGC AUA GGC TTT GCG GTG GTG GCT GAT GAG GTA AGA 1690 
Ala Gly Glu His Gly Arg Gly Phe Ala Val Val Ala Asp Glu Val Arg 
530 535 540 545 

AAG CTC GCT GAA AGG ACG CAA AAA TCG CTC AGC GAG ATT GAA GCC AAT 17 3 8 

Lys Leu Ala Glu Arg Thr Gin Lys Ser Leu Ser Glu lie Glu Ala Asn 
550 555 560 

ATC AAT ATT TTA GTG CAA AGC ATT TCA GAC ACG AGC GAA AGC ATT AAA 17 8 6 

He Asn He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He Lys 
565 570 575 

AAC CAG GTT AAA GAA GTG GAA GAA ATC AAC GCT TCT ATT GAA GCC TTA 1834 
Asn Gin Val Lys Glu Val Glu Glu He Asn Ala Ser He Glu Ala Leu 
580 585 590 

AGA TCG GTT ACT GAG GGC AAT CTA AAA ATC GCT AGC GAT TCT TTA GAA 18 82 

Arg Ser Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu Glu 
595 600 605 

ATC AGT CAA GAA ATT GAC AAA GTT TCT AAC GAT ATT TTA GAA GAT GTG 193 0 

He Ser Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp Val 
610 615 620 625 

AAT AAA AAG CAG TTT TAATGCTCAT TCATATTTGC TGCTCAGTGG ATAACCTCTA T 19 86 
Asn Lys Lys Gin Phe 
630 



1986 



(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

Met Gin Asp Ser Leu His Phe Lys Val Asn Glu Val Gin Gly Val Leu 

15 10 15 

Glu Asn Thr Tyr Thr Ser Met Gly He Val Lys Glu Met Leu Pro Lys 

20 25 30 

Asp Thr Lys Arg Glu He Lys He Gly Leu Leu Lys Asn Phe He Leu 

35 40 45 

Ala Asn Ser His Val Ala Gly Val Ser Met Phe Phe Lys Gly Arg Glu 

50 55 60 

Asp Leu Arg Leu Thr Leu Leu Arg Asp Asn Asn Thr He Lys Leu Val 

65 70 75 80 

Glu Asn Pro Ser Leu Glu Asn Ser Pro Leu Ala Gin Lys Ala Met Lys 

85 90 95 

Asn Lys Glu He Ser Lys Ser Leu Gly Tyr Tyr Arg Lys Met Pro Asn 
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100 

Gly Ala Glu Val 
115 

Asn Ala Gin Glu 
130 

Ser Phe Ser Asn 
145 

Gly Thr Lys Gly 

Lys Pro lie Ala 
180 

Val Met Ala lie 
195 

Asp Pro Phe Ser 
210 

Met Leu Gly Lys 
225 

Leu lie lie Glu 

Phe Val Val He 
260 

Ala He Thr Leu 
275 

Val Ser Ser Thr 
290 

Asn Ser Ser Gly 
305 

Gly Arg Met Gin 

lie Met Gin Glu 
340 

Ser Asp Val Lys 
355 

Ala Ser Pro Asp 
370 

Asp Tyr Leu Gin 
385 

lie Phe Glu Ser 

Ala Ser Gly Arg 
420 

Gin Lys Met Leu 
435 

Asp Ser Ala Asn 
450 

Asn Ser Gin His 
465 

He Thr Thr Ser 

Glu Gin Gly Gin 
500 

Ala Asp Gin Thr 
515 

Arg Ala Gly Glu 
530 



Tyr Gly Val Asp 
120 

Val Val Gly Ala 
135 

Glu He Thr Lys 
150 

Lys Val Leu Leu 
165 

Glu He Tyr Lys 

Leu Glu Asn Gly 
200 

His Lys Glu Asn 
215 

Thr Glu Ser Lys 
230 

Lys Asp Lys Val 
245 

He Ala Ser Ala 

Leu Met Arg Ala 
280 

Leu Ser His Phe 
295 

He Lys Leu He 
310 

Thr Ala He Asn 
325 

Asp Arg Gin Ala 

Ala Gly Asn Phe 
360 

Leu Lys Glu Leu 
375 

Glu Ser Val Gly 
390 

Tyr Ser Gly Leu 
405 

Val Glu Leu Val 

Glu Thr Ser Ser 
440 

Leu Lys Glu Cys 
455 

Lys Ser Leu Met 
470 

He Gin Gly Val 
485 

Asp He Lys Ser 

Asn Leu Leu Ala 
520 

His Gly Arg Gly 
535 



-309- 

105 

He Leu Leu Pro 

Leu Met He Phe 
140 

Asn Arg Ser Asp 
155 

Ser Ala Asn Lys 
170 

Ser Val Pro Lys 
185 

Ser Lys Ala Thr 

Phe Leu Ala Val 
220 

Asp Asn Leu Asn 
235 

Tyr Glu Gin Val 
250 

He Met Val Leu 
265 

He Val Ser Ser 

Phe Lys Leu Leu 
300 

Glu Ala Lys Ser 
315 

Lys Asn He Leu 
330 

Val Gin Asp Thr 
345 

Ala Val Arg He 

Arg Asp Ala Leu 
380 

Thr His Met Pro 
395 

As p Phe Arg Gly 
410 

Thr Asn Ala Leu 
425 

Asn Phe Ala Lys 

Val Gin Asn Leu 
460 

Glu Thr Ser Lys 
475 

Ser Ser Gin Ser 
490 

He Val Glu He 
505 

Leu Asn Ala Ala 

Phe Ala Val Val 
540 



110 

Leu Leu Asn Glu 
125 

He Ser He Asp 

Leu Phe Leu He 
160 

Ser Leu Gin Asp 
175 

Ala Thr Asn Glu 
190 

Leu Glu Tyr Leu 
205 

Glu Thr Phe Lys 

Trp Met He Ala 
240 

Gly Ser Val Arg 
255 

Ala Leu He He 
270 

Arg Leu Glu Ala 
285 

Asn Asn Gin Ala 

Asn Asp Glu Leu 
320 

Gin Thr Gin Lys 
335 

He Lys Val Val 
350 

Thr Ala Glu Pro 
365 

Asn Gly He Met 

Ser He Phe Lys 
400 

Arg He Gin Asn 
415 

Gly Gin Glu He 
430 

Asp Leu Ala Asn 
445 

Glu Lys Ala Ser 

Thr He Glu Asn 
480 

Glu Ala Met He 
495 

He Arg Asp He 
510 

He Glu Ala Ala 
525 

Ala Asp Glu Val 
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-310- 

Arg Lys Leu Ala Glu Arg Thr Gin Lys • Ser Leu Ser Glu lie Glu Ala 
545 550 555 560 

Asn lie Asn He Leu Val Gin Ser He Ser Asp Thr Ser Glu Ser He 

565 570 575 

Lys Asn Gin Val Lys Glu Val Glu Glu He Asn Ala Ser He Glu Ala 

580 585 590 

Leu Arg Ser Val Thr Glu Gly Asn Leu Lys He Ala Ser Asp Ser Leu 

595 600 605 

Glu He Ser Gin Glu He Asp Lys Val Ser Asn Asp He Leu Glu Asp 

610 615 620 

Val Asn Lys Lys Gin Phe 
625 630 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME/ KEY : Coding Sequence 

(B) LOCATION: 8... 1702 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

GAGATAA ATG ATG TTT TCT TCA ATG TTT GCT TCG TTG GGG ACT CGT ATC 4 9 

Met Met Phe Ser Ser Met Phe Ala Ser Leu Gly Thr Arg He 
15 10 

ATG CTG GTC GTG TTA GCC GCT CTT TTA GGT TTA GGG GGG CTT TTT ATT 97 
Met Leu Val Val Leu Ala Ala Leu Leu Gly Leu Gly Gly Leu Phe He 
15 20 25 30 

GGT TTT GTA AAG GTT ATG CAA AAA GAT GTG TTA GCG CAA CTC ATG GAG 14 5 

Gly Phe Val Lys Val Met Gin Lys Asp Val Leu Ala Gin Leu Met Glu 
35 40 45 

CAT TTA GAA ACC GGG CAA TAC AAA AAG CGT GAA AAA ACG CTC GCT TAC 193 
His Leu Glu Thr Gly Gin Tyr Lys Lys Arg Glu Lys Thr Leu Ala Tyr 
50 55 60 

ATG ACA AAA ATT ATT GAA CAG GGC ATT CAT GAG TAT TAC AAA AAT TTT 241 
Met Thr Lys He lie Glu Gin Gly He His Glu Tyr Tyr Lys Asn Phe 

65 70 75 

GAC AAT GCT ACT GCA AG A AAA ATG GCG TTA GAT TAT TTC AAA CGC ATC 28 9 

Asp Asn Ala Thr Ala Arg Lys Met Ala Leu Asp Tyr Phe Lys Arg He 
80 85 90 
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AAC GAC GAT AAG GGC ATG ATT TAT ATG GTG GTG GTG GAT AAA AAC GGG 3 37 

Asn Asp Asp Lys Gly Met lie Tyr Met Val Val Val Asp Lys Asn Gly 
95 100 105 110 

GTG GTA TTG TTT GAT CCG GTC AAT CCT AAA ACC GTA GNC CAA TCA GGG 3 85 

Val Val Leu Phe Asp Pro Val Asn Pro Lys Thr Val Xaa Gin Ser Gly 
115 120 125 

CTT GAC GCT CAG AGC GTT GAT GGG GTG TAT TAT GTT AGG GGG TAT TTG 43 3 

Leu Asp Ala Gin Ser Val Asp Gly Val Tyr Tyr Val Arg Gly Tyr Leu 
130 135 140 

GAG GCG GCC AAA AAA GGG GGA GGC TAC ACT TAT TAT AAA ATG CCT AAA 481 
Glu Ala Ala Lys Lys Gly Gly Gly Tyr Thr Tyr Tyr Lys Met Pro Lys 
145 150 155 

TAC GAT GGA GGC GTA CCG GAG AAA AAA TTC GCC TAC TCG CAT TAT GAT 52 9 

Tyr Asp Gly Gly Val Pro Glu Lys Lys Phe Ala Tyr Ser His Tyr Asp 
160 165 170 

GAA GTT TCT CAA ATG GTG ATC GCA ACG ACT TCC TAT TAC ACT GAC ATT 577 
Glu Val Ser Gin Met Val lie Ala Thr Thr Ser Tyr Tyr Thr Asp lie 
175 180 185 190 

AAC ACA GAA AAT AAA GCG ATC AAA GAA GGC GTG AAT AAG GTT TTT GAT 62 5 

Asn Thr Glu Asn Lys Ala lie Lys Glu Gly Val Asn Lys Val Phe Asp 
195 200 205 

GAA AAC ACC ACG AAA TTA TTC CTT TGG ATA CTG ACA GCG ACG ATA GCG 673 
Glu Asn Thr Thr Lys Leu Phe Leu Trp lie Leu Thr Ala Thr lie Ala 
210 215 220 

CTA GTG GTT TTG ACG CTC ATA TAC GCT AAA TTA AGG ATC GTG AAA CGC 721 
Leu Val Val Leu Thr Leu lie Tyr Ala Lys Leu Arg lie Val Lys Arg 
225 230 235 

ATT GAT GAA CTG GTC CTT AAA ATC AAC GCT TTT AGC CGT GGG GAT AAG 76 9 

lie Asp Glu Leu Val Leu Lys lie Asn Ala Phe Ser Arg Gly Asp Lys 
240 245 250 

GAT TTG AGA GCC AAA ATT GAT GTG GGT GAT CGC AAC GAT GAA ATC TCG 817 
Asp Leu Arg Ala Lys lie Asp Val Gly Asp Arg Asn Asp Glu lie Ser 
255 260 265 270 

CAA GTG GGC CGT GGG ATC AAT TTG TTT GTG GAA AAC GCC CGC TTG ATT 865 
Gin Val Gly Arg Gly lie Asn Leu Phe Val Glu Asn Ala Arg Leu He 
275 280 285 

ATG GAA GAG ATT AAA GGG ATT TCC ACC CTC AAT AAA ACT TCA ATG GAT 913 
Met Glu Glu He Lys Gly He Ser Thr Leu Asn Lys Thr Ser Met Asp 
290 295 300 

AAA TTA GTC CAA ATC ACG CAA GAA ACC CAA AAG AGC ATG AAA GAT TCC 961 
Lys Leu Val Gin He Thr Gin Glu Thr Gin Lys Ser Met Lys Asp Ser 
305 310 315 
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-312- 



TCA ACC ACC CTA AAT Ttu GTG AAA AAT AAA GCC ACT GAT ATA GCG AGC 100 9 

Ser Thr Thr Leu Asn Ser Val Lys Asn Lys Ala Thr Asp He Ala Ser 
320 325 330 

ATG ATG AAT GCT TCC ATA GAG CAA TCT CAA GGG TTA AGG AAG CGT TTG 105 7 

Met Met Asn Ala Ser He Glu Gin Ser Gin Gly Leu Arg Lys Arg Leu 
335 340 345 350 

ATT GAA ACG CAA GGG CTG GTC AAA GAG AGC AAG GAT GCG ATC GGG GAT 110 5 

He Glu Thr Gin Gly Leu Val Lys Glu Ser Lys Asp Ala He Gly Asp 
355 360 365 

TTA TTT TCT CAA ATC ACA GAG AGC GCG CAC ACT GAA GAG GAA CTC TCT 115 3 

Leu Phe Ser Gin He Thr Glu Ser Ala His Thr Glu Glu Glu Leu Ser 
370 375 380 

AGC AAA GTG GAG CAG CTA AGC CGT AAC GCT GAT GAT GTC AAA TCC ATT 12 01 

Ser Lys Val Glu Gin Leu Ser Arg Asn Ala Asp Asp Val Lys Ser He 
385 390 395 

CTG GAT ATT ATC AAT GAT ATT GCC GAT CAA ACG AAT TTA TTA GCC CTA 124 9 

Leu Asp He He Asn Asp He Ala Asp Gin Thr Asn Leu Leu Ala Leu 
400 405 410 

AAC GCT GCT ATT GAA GCC GCA AGG GCT GGC GAG CAT GGC AGA GGC TTT 12 97 

Asn Ala Ala He Glu Ala Ala Arg Ala Gly Glu His Gly Arg Gly Phe 
415 420 425 430 

GCG GTG GTG GCT GAT GAA GTT AGG AAT TTA GCC GGG CGC ACT CAA AAG 134 5 

Ala Val Val Ala Asp Glu Val Arg Asn Leu Ala Gly Arg Thr Gin Lys 
435 440 445 

TCT TTA GCC GAA ATC AAT TCC ACT ATC ATG GTG ATT GTC CAA GAA ATC 13 93 

Ser Leu Ala Glu He Asn Ser Thr He Met Val He Val Gin Glu He 
450 455 460 

AAT GCC GTG AGT TCG CAA ATG AAT CTC AAT TCG CAA AAA ATG GAG CGT 1441 
Asn Ala Val Ser Ser Gin Met Asn Leu Asn Ser Gin Lys Met Glu Arg 
465 470 475 

TTG AGC GAT ATG AGT AAA AGC GTG CAA GAA ACT TAC GAA AAA ATG AGT 14 8 9 

Leu Ser Asp Met Ser Lys Ser Val Gin Glu Thr Tyr Glu Lys Met Ser 
480 485 490 

TCT AAT TTA AGC TCA GTC GTG TCA GAC AGC AAT CAA AGC ATG GAC GAT 153 7 

Ser Asn Leu Ser Ser Val Val Ser Asp Ser Asn Gin Ser Met Asp Asp 
495 500 505 510 

TAC GCC AAA TCC GGA CAC CAA ATT GAA GTT ATG GTA AGC GAT TTT GCA 15 8 5 

Tyr Ala Lys Ser Gly His Gin He Glu Val Met Val Ser Asp Phe Ala 
515 520 525 

GAG GTG GAA AAA GTG GCT TCT AAG ACT TTA GCG GAT TCT TCA GAT ATT 16 3 3 

Glu Val Glu Lys Val Ala Ser Lys Thr Leu Ala Asp Ser Ser Asp He 
530 535 540 
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-313- 

TTA AAC ATC GCT ACG Cat GTG AGT GGA ACG ACC ATG AAT TTA GAG AAA 16 81 

Leu Asn lie Ala Thr His Val Ser Gly Thr Thr Met Asn Leu Asp Lys 
545 550 555 

CAA GTG AAT TTG TTT AAA ACT TAATCAGGGG GAG TTT ATT A AAAAAGGGTT GGAT 173 6 
Gin Val Asn Leu Phe Lys Thr 
560 565 

TGTTAAAAGT TTCTGTGATC AC 17 5 8 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: 

Met Met Phe Ser Ser Met Phe Ala Ser Leu Gly Thr Arg lie Met Leu 

15 10 15 

Val Val Leu Ala Ala Leu Leu Gly Leu Gly Gly Leu Phe lie Gly Phe 

20 25 30 

Val Lys Val Met Gin Lys Asp Val Leu Ala Gin Leu Met Glu His Leu 

35 40 45 

Glu Thr Gly Gin Tyr Lys Lys Arg Glu Lys Thr Leu Ala Tyr Met Thr 

50 55 60 

Lys lie lie Glu Gin Gly lie His Glu Tyr Tyr Lys Asn Phe Asp Asn 
65 70 75 80 

Ala Thr Ala Arg Lys Met Ala Leu Asp Tyr Phe Lys Arg lie Asn Asp 

85 90 95 

Asp Lys Gly Met lie Tyr Met Val Val Val Asp Lys Asn Gly Val Val 

100 105 110 

Leu Phe Asp Pro Val Asn Pro Lys Thr Val Xaa Gin Ser Gly Leu Asp 

115 120 125 

Ala Gin Ser Val Asp Gly Val Tyr Tyr Val Arg Gly Tyr Leu Glu Ala 

130 135 140 

Ala Lys Lys Gly Gly Gly Tyr Thr Tyr Tyr Lys Met Pro Lys Tyr Asp 
145 150 155 160 

Gly Gly Val Pro Glu Lys Lys Phe Ala Tyr Ser His Tyr Asp Glu Val 

165 170 175 

Ser Gin Met Val lie Ala Thr Thr Ser Tyr Tyr Thr Asp lie Asn Thr 

180 185 190 

Glu Asn Lys Ala lie Lys Glu Gly Val Asn Lys Val Phe Asp Glu Asn 

195 200 205 

Thr Thr Lys Leu Phe Leu Trp lie Leu Thr Ala Thr He Ala Leu Val 

210 215 220 

Val Leu Thr Leu He Tyr Ala Lys Leu Arg He Val Lys Arg He Asp 
225 230 235 240 

Glu Leu Val Leu Lys He Asn Ala Phe Ser Arg Gly Asp Lys Asp Leu 

245 250 255 

Arg Ala Lys He Asp Val Gly Asp Arg Asn Asp Glu He Ser Gin Val 
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260 

Gly Arg Gly lie 
275 

Glu He Lys Gly 
290 

Val Gin He Thr 
305 

Thr Leu Asn Ser 

Asn Ala Ser He 
340 

Thr Gin Gly Leu 
355 

Ser Gin He Thr 
370 

Val Glu Gin Leu 
385 

He He Asn Asp 

Ala He Glu Ala 
420 

Val Ala Asp Glu 
435 

Ala Glu He Asn 
450 

Val Ser Ser Gin 
465 

Asp Met Ser Lys 

Leu Ser Ser Val 
500 

Lys Ser Gly His 
515 

Glu Lys Val Ala 
530 

He Ala Thr His 
545 

Asn Leu Phe Lys 



Asn Leu Phe Val 
280 

He Ser Thr Leu 
295 

Gin Glu Thr Gin 
310 

Val Lys Asn Lys 
325 

Glu Gin Ser Gin 

Val Lys Glu Ser 
360 

Glu Ser Ala His 
375 

Ser Arg Asn Ala 
390 

He Ala Asp Gin 
405 

Ala Arg Ala Gly 

Val Arg Asn Leu 
440 

Ser Thr He Met 
455 

Met Asn Leu Asn 
470 

Ser Val Gin Glu 
485 

Val Ser Asp Ser 

Gin He Glu Val 
520 

Ser Lys Thr Leu 
535 

Val Ser Gly Thr 
550 

Thr 
565 



-314- 
265 ■ 

Glu Asn Ala Arg 

Asn Lys Thr Ser 
300 

Lys Ser Met Lys 
315 

Ala Thr Asp He 
330 

Gly Leu Arg Lys 
345 

Lys Asp Ala He 

Thr Glu Glu Glu 
380 

Asp Asp Val Lys 
395 

Thr Asn Leu Leu 
410 

Glu His Gly Arg 
425 

Ala Gly Arg Thr 

Val He Val Gin 
460 

Ser Gin Lys Met 
475 

Thr Tyr Glu Lys 
490 

Asn Gin Ser Met 
505 

Met Val Ser Asp 

Ala Asp Ser Ser 
540 

Thr Met Asn Leu 
555 



270 

Leu He Met Glu 
285 

Met Asp Lys Leu 

Asp Ser Ser Thr 
320 

Ala Ser Met Met 
335 

Arg Leu He Glu 
350 

Gly Asp Leu Phe 
365 

Leu Ser Ser Lys 

Ser He Leu Asp 
400 

Ala Leu Asn Ala 
415 

Gly Phe Ala Val 
430 

Gin Lys Ser Leu 
445 

Glu He Asn Ala 

Glu Arg Leu Ser 
480 

Met Ser Ser Asn 
495 

Asp Asp Tyr Ala 
510 

Phe Ala Glu Val 
525 

Asp He Leu Asn 

Asp Lys Gin Val 
560 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 16 . . . 660 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

TATAAGGTTG CTCTC ATG AAA AAA CCC TAT AGG AAG ATT TCT GAT TAT GCG 51 
Met Lys Lys Pro Tyr Arg Lys lie Ser Asp Tyr Ala 
15 10 

ATC GTG GGT GGT TTG AGC GCG TTA GTG ATG GTG AGC ATT GTG GGG TGT 99 
lie Val Gly Gly Leu Ser Ala Leu Val Met Val Ser He Val Gly Cys 
15 20 25 

AAG AGC AAT GCT GAT GAC AAA CCA AAA GAG CAA AGC TCT TTA AGT CAA 14 7 

Lys Ser Asn Ala Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin 
30 35 40 

AGC GTT CAA AAA GGC GCG TTT GTG ATT TTA GAA GAG CAA AAG GAT AAA 195 
Ser Val Gin Lys Gly Ala Phe Val He Leu Glu Glu Gin Lys Asp Lys 
45 50 55 60 

TCT TAC AAG GTT GTT GAA GAA TAC CCC AGC TCA AGA ACC CAC ATT ATA 24 3 

Ser Tyr Lys Val Val Glu Glu Tyr Pro Ser Ser Arg Thr His lie He 
65 70 75 

GTG CGC GAT TTG CAA GGC AAT GAA CGC GTG TTA AGC AAT GAA GAG ATT 2 91 

Val Arg Asp Leu Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu He 
80 85 90 

CAA AAG CTC ATC AAA GAA GAA GAA GCT AAA ATT GAT AAC GGC ACG AGC 339 
Gin Lys Leu He Lys Glu Glu Glu Ala Lys He Asp Asn Gly Thr Ser 
95 100 105 

AAG CTT GTC CAG CCT AAT AAT GGA GGG AGT AAT GAA GGC TCA GGC TTT 3 87 

Lys Leu Val Gin Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe 
110 115 120 

GGC TTG GGG AGC GCG ATT TTA GGG AGC GCG GCG GGG GCG ATT TTA GGG 4 35 

Gly Leu Gly Ser Ala He Leu Gly Ser Ala Ala Gly Ala He Leu Gly 
125 130 135 140 

AGT TAT ATT GGT AAT AAG CTT TTC AAT AAC CCT AAT TAC CAG CAA AAC 4 83 

Ser Tyr He Gly Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn 
145 150 155 

GCC CAA CGG ACC TAC AAA TCC CCA CAA GCT TAC CAA CGC TCT CAA AAT 5 31 

Ala Gin Arg Thr Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn 
160 165 170 

TCC TTT TCT AAA AGT GCG CCC AGT GCT TCA AGC ATG GGC GGA GCG AGT 5 79 

Ser Phe Ser Lys Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser 
175 180 185 

AAG GGA CAG AGC GGG TTT TTT GGC TCT AGT AGG CCT ACT AGT TCA CCG 627 
Lys Gly Gin Ser Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro 
190 195 200 
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GCG GTA AGC TUT UUu ACA AGG GGC TTT AAC TCA TAATTTAATT GATTCAAGGC 
Ala Val Ser Ser Gly Thr Arg Gly Phe Asn Ser 
205 210 215 

TAAAAA 

(2) INFORMATION FOR SEQ ID NO : 164 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 164 : 

Met Lys Lys Pro Tyr Arg Lys lie Ser Asp Tyr Ala lie Val Gly Gly 

1 5 10 15 

Leu Ser Ala Leu Val Met Val Ser He Val Gly Cys Lys Ser Asn Ala 

20 25 30 

Asp Asp Lys Pro Lys Glu Gin Ser Ser Leu Ser Gin Ser Val Gin Lys 

35 40 45 

Gly Ala Phe Val He Leu Glu Glu Gin Lys Asp Lys Ser Tyr Lys Val 

50 55 60 

Val Glu Glu Tyr Pro Ser Ser Arg Thr His He He Val Arg Asp Leu 
65 70 75 80 

Gin Gly Asn Glu Arg Val Leu Ser Asn Glu Glu He Gin Lys Leu He 

85 90 95 

Lys Glu Glu Glu Ala Lys He Asp Asn Gly Thr Ser Lys Leu Val Gin 

100 105 HO 

Pro Asn Asn Gly Gly Ser Asn Glu Gly Ser Gly Phe Gly Leu Gly Ser 

115 120 125 

Ala He Leu Gly Ser Ala Ala Gly Ala He Leu Gly Ser Tyr He Gly 

130 135 140 

Asn Lys Leu Phe Asn Asn Pro Asn Tyr Gin Gin Asn Ala Gin Arg Thr 
145 150 155 160 

Tyr Lys Ser Pro Gin Ala Tyr Gin Arg Ser Gin Asn Ser Phe Ser Lys 

165 170 175 

Ser Ala Pro Ser Ala Ser Ser Met Gly Gly Ala Ser Lys Gly Gin Ser 

180 185 190 

Gly Phe Phe Gly Ser Ser Arg Pro Thr Ser Ser Pro Ala Val Ser Ser 

195 200 205 

Gly Thr Arg Gly Phe Asn Ser 
210 215 



680 



686 



(2) INFORMATION FOR SEQ ID NO: 16 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 874 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE : 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 16.. .8694 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 

AGAGGGTAGC ATTTA ATG AAA AAG TTT AAA AAG AAA CCA AAA AGT ATC AAA 51 
Met Lys Lys Phe Lys Lys Lys Pro Lys Ser lie Lys 
15 10 

CGA TCG CAT CAA AAT CAA AAA ACA ATC TTA AAG CGT CCT TTA TGG CTT 99 
Arg Ser His Gin Asn Gin Lys Thr lie Leu Lys Arg Pro Leu Trp Leu 
15 20 25 

ATG CCT TTA CTC ATC AGC GGG TTT GCT AGT GGG GTG TAT GCG AAT AAT 147 
Met Pro Leu Leu lie Ser Gly Phe Ala Ser Gly Val Tyr Ala Asn Asn 
30 35 40 

CTG TGG GAT TTG TTA AAC CCA AAA GTG GGG GGT GAG TAT GTG CAT TGG 195 
Leu Trp Asp Leu Leu Asn Pro Lys Val Gly Gly Glu Tyr Val His Trp 
45 50 55 60 

GTT AAG GGC AGT CAG TAT TGT GCA TGG TGG GAA TTT GCT GGG TGT TTA 24 3 

Val Lys Gly Ser Gin Tyr Cys Ala Trp Trp Glu Phe Ala Gly Cys Leu 
65 70 75 

AAG AAT GTA TGG GGG GCA AAT CAT AAA GGC TAT GAT GCT GGA AAC GCC 2 91 

Lys Asn Val Trp Gly Ala Asn His Lys Gly Tyr Asp Ala Gly Asn Ala 
80 85 90 

GCT AAC TAT TTG TCT TCT CAA AAC TAT CAA GCT ATT TCG GTG GGT AGT 33 9 

Ala Asn Tyr Leu Ser Ser Gin Asn Tyr Gin Ala lie Ser Val Gly Ser 
95 100 105 

GGG AAT GAA ACG GGG ACT TAT AGT TTA AGC GGT TTT ACC AAT TAT GTT 3 87 

Gly Asn Glu Thr Gly Thr Tyr Ser Leu Ser Gly Phe Thr Asn Tyr Val 
110 115 120 

GGG GGC AAT CTC ACG ATC AAT CTA GGC AAT AGC GTT GTT TTA GAT TTA 4 35 

Gly Gly Asn Leu Thr lie Asn Leu Gly Asn Ser Val Val Leu Asp Leu 
125 130 135 140 

AGC GGT TCT AAT AGT TTC ACT TCG TAT CAA GGT TAT AAT CAA GGC AAA 4 83 

Ser Gly Ser Asn Ser Phe Thr Ser Tyr Gin Gly Tyr Asn Gin Gly Lys 
145 150 155 

GAT GAT GTA ACA TTT ACG GTT GGC GCA ATC AAT TTA AAC GGC ACT TTA 531 
Asp Asp Val Thr Phe Thr Val Gly Ala He Asn Leu Asn Gly Thr Leu 
160 165 170 
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GAA GTG GGT AAT CGT GTG GGA TCG GGA GCT GGC ACG CAC ACC GGC ACA 57 9 

Glu Val Gly Asn Arg Val Gly Ser Gly Ala Gly Thr His Thr Gly Thr 
175 180 185 

GCC ACT TTA AAC TTG AAC GCT AAT AAG GTC AAT ATC AAT TCC AAT ATC 627 
Ala Thr Leu Asn Leu Asn Ala Asn Lys Val Asn lie Asn Ser Asn lie 
190 195 200 

AAC GCG TAT AAA ACT TCG CAA GTG AAT ATA GGC AAC GCT AAC AGC GTT 67 5 

Asn Ala Tyr Lys Thr Ser Gin Val Asn He Gly Asn Ala Asn Ser Val 
205 210 215 220 

ATT ACC ATT GGT TCG GTT TCT TTG AGT GGG GAT GTT TGC AGT TCT TTA 723 
He Thr He Gly Ser Val Ser Leu Ser Gly Asp Val Cys Ser Ser Leu 
225 230 235 

GCT AGC GTT GGG ATA GGG GCT AAT TGC TCC ACT TCT GGG CCT AGC TAT 771 
Ala Ser Val Gly He Gly Ala Asn Cys Ser Thr Ser Gly Pro Ser Tyr 
240 245 250 

TCT TTT AAA GGG ACG ACT AAC GCT ACT AAC ACG GCG TTT AGT AAT GCA 819 
Ser Phe Lys Gly Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala 
255 260 265 

AGC GGC AGT TTC ACT TTT GAA GAG AAC GCC ACT TTT AGC GGG GCG AAA 8 67 

Ser Gly Ser Phe Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys 
270 275 280 

TGG AAT GGG GGG ACT TAT ACC TTT AAT AAA GAG TTT AGC GCT ACC AAT 915 
Trp Asn Gly Gly Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn 
285 290 295 300 

AAC ACC GCC TTT AGT AGC GGT AGT TTT AAT TTT AAA GGT GTA AGC TCT 96 3 

Asn Thr Ala Phe Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser 
305 310 315 

TTT AAT GGT ACT TCG TTT AGT AAC GCT TCT TAT ACT TTT GAC AAT CAA 1011 
Phe Asn Gly Thr Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin 
320 325 330 

GCC ACT TTC CAA AAC AGC TCC TTT AAT GGG GGG ACT TTT ACT TTT AAT 105 9 

Ala Thr Phe Gin Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn 
335 340 345 

AAC CAA ACT AAT CCA ACT AAC AAC GCT CAG CAC CCC CAA ATT CAA AAC 110 7 

Asn Gin Thr Asn Pro Thr Asn Asn Ala Gin His Pro Gin He Gin Asn 
350 355 360 

AGC TCT TTT AGT GGT AAC GCT ACC ACT CTT AAG GGC TTT GTG AAT TTC 115 5 

Ser Ser Phe Ser Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe 
365 370 375 380 

CAG CAA GCC TTT AAC AAT TCA AAC CAC CAA CTA ACG ATC CAA AAC GCT 12 03 

Gin Gin Ala Phe Asn Asn Ser Asn His Gin Leu Thr He Gin Asn Ala 
385 390 395 
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TCC TTT AAT 
Ser Phe Asn 



AAA GAT GCG 
Lys Asp Ala 
415 

AAC AAC ATG 
Asn Asn Met 
430 

TTG AAA AAT 
Leu Lys Asn 
445 

GCT CAA GGG 
Ala Gin Gly 



GTA ACG ATT 
Va3 Thr lie 



AAC CAT GCA 
Asn His Ala 
495 

CTT TCA AAT 
Leu Ser Asn 
510 

TAC AAT GGG 
Tyr Asn Gly 
525 

AAA CCC ACT 
Lys Pro Thr 



GTC TAT CAA 
Val Tyr Gin 



GAA ACT TTC 
Glu Thr Phe 
575 

ACT TAC ACG 
Thr Tyr Thr 
590 

GCT TCA AAT 
Ala Ser Asn 
605 



AAC GCC ACT 
Asn Ala Thr 
400 

AGT TTT AAT 
Ser Phe Asn 



AGT GTT ACC 

Ser Val Thr 



GGC TCA ACC 
Gly Ser Thr 
450 

ACG ACT TTC 
Thr Thr Phe 
465 

TTA AAT TCT 
Leu Asn Ser 
480 

ATC AAC GGC 
lie Asn Gly 



CCG CAA AGT 
Pro Gin Ser 



GTT ACC GGG 
Val Thr Gly 
530 

GAC TCT TCG 
Asp Ser Ser 
545 

GTG GGT TAC 
Val Gly Tyr 
560 

AGC CAC AAT 
Ser His Asn 



CCA CCC CCT 
Pro Pro Pro 



TAT ATC AAT 
Tyr lie Asn 
610 



TTT AAC AAT 
Phe Asn Asn 
405 

AAC ACG ACA 
Asn Thr Thr 
420 

GGT GGC GTT 
Gly Gly Val 
435 

CTT GAT TTT 
Leu Asp Phe 

AAC CTC ACA 
Asn Leu Thr 



AGC GGT GGG 
Ser Gly Gly 
485 

TTG ACA AGT 
Leu Thr Ser 
500 

TTC GCT CAA 
Phe Ala Gin 
515 

CAG CTT TTG 
Gin Leu Leu 



CCC TCT AAA 
Pro Ser Lys 



AAA ATA GGG 
Lys lie Gly 
565 

TCC ATT ATT 
Ser lie lie 
580 

GTC ATT AAC 
Val lie Asn 
595 

GCT GAC ATG 
Ala Asp Met 



319- 

ACC GGT AAA 
Thr Gly Lys 



TTC AAC ACT 
Phe Asn Thr 



ACT TTA AGC 
Thr Leu Ser 
440 

GGG AGT TCT 
Gly Ser Ser 
455 

AGT TTA GGC 
Ser Leu Gly 
470 

ATC ACT TAT 
He Thr Tyr 

GCC TTA AAA 
Ala Leu Lys 

GGT TTG TGG 
Gly Leu Trp 
520 

AAT GAA AAC 
Asn Glu Asn 
535 

TCC TCT ACA 
Ser Ser Thr 
550 

GAT ACT ATC 
Asp Thr He 



ATT CAG GCT 
He Gin Ala 



GGC TCC AAA 
Gly Ser Lys 
600 

CCT TGG TAT 
Pro Trp Tyr 
615 



ATC ACT ATA 
He Thr He 
410 

TCT GTT GAT 
Ser Val Asp 
425 

GGT AAA AAT 
Gly Lys Asn 



AAA ATC ACT 
Lys He Thr 



AGT GAG AAG 
Ser Glu Lys 
475 

AGT AAC CTT 
Ser Asn Leu 
490 

ACG AAC GAA 
Thr Asn Glu 
505 

GAT ATA ATC 
Asp lie He 



GCT GCA ACA 
Ala Ala Thr 



AAC TCT ACG 
Asn Ser Thr 
555 

TAC AAA CTG 
Tyr Lys Leu 
570 

TTA GAG AGC 
Leu Glu Ser 
585 

TTT GAC TTA 
Phe Asp Leu 



GAC CAT AAA 
Asp His Lys 



GAA 1251 
Glu 



ACA 12 99 

Thr 



GAC 1347 
Asp 



CTC 13 95 

Leu 

460 

AGC 1443 
Ser 



TTA 14 91 

Leu 



AGC 1539 
Ser 



ACT 1587 
Thr 



TCT 1635 

Ser 

540 

CAA 1683 
Gin 



CAA 1731 
Gin 



GGG 1779 
Gly 



TCC 1827 
Ser 



TAT 1875 

Tyr 

620 
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TAC ATC CCT AAA TCC CAA AAT TTT AC A GAG AGC GGG ACT TAT TAC TTG 192 3 

Tyr He Pro Lys Ser Gin Asn Phe Thr Glu Ser Gly Thr Tyr Tyr Leu 
625 630 635 

CCG AGC GTC CAA ATA TGG GGG AGC TAC ACT AAC TCG TTT AAA CAA ACT 1971 
Pro Ser Val Gin He Trp Gly Ser Tyr Thr Asn Ser Phe Lys Gin Thr 
640 645 650 

TTT AGC GCA AAT GGT AGT AAT CTG GTG ATT GGG TAT AAC TCA ACA TGG 2019 
Phe Ser Ala Asn Gly Ser Asn Leu Val He Gly Tyr Asn Ser Thr Trp 
655 660 665 

ACT GAT CAT AAT GTC TCT TCT AGC GGC ACG GTG TCT TTT GGG GAC ACT 2 0 67 

Thr Asp His Asn Val Ser Ser Ser Gly Thr Val Ser Phe Gly Asp Thr 
670 675 680 

TCA GGG AGC GCT CTT AAT GGG CAT TGC GGA CCT TGG CCG TAT TAC CAA 2115 
Ser Gly Ser Ala Leu Asn Gly His Cys Gly Pro Trp Pro Tyr Tyr Gin 
685 690 695 700 

TGC ACA GGC ACG ACT AAC GGC ACT TAT AGC GCC TAT CAT GTG TAT ATC 2163 
Cys Thr Gly Thr Thr Asn Gly Thr Tyr Ser Ala Tyr His Val Tyr He 
705 710 715 

ACA GCG AAT CTG CGT TCT GGC AAT CGT ATA GGC ACC GGT GGG GCA GCT 2211 
Thr Ala Asn Leu Arg Ser Gly Asn Arg He Gly Thr Gly Gly Ala Ala 

720 725 730 

AAT CTA ATC TTT AAT GGG GTA GAT AGT ATC AAT ATC GCT AAC GCT ACC 22 5 9 

Asn Leu He Phe Asn Gly Val Asp Ser He Asn He Ala Asn Ala Thr 
735 740 745 

ATC ACG CAA CAT AAC GCC GGA ATC TAT TCA AGC TCT ATG ACT TTT TCC 2 3 07 

He Thr Gin His Asn Ala Gly He Tyr Ser Ser Ser Met Thr Phe Ser 
750 755 760 

ACG CAA AGC ATG GAT AAT TCG CAG AAT TTG AAT GGT CTA AAT TCT AAC 2 3 55 

Thr Gin Ser Met Asp Asn Ser Gin Asn Leu Asn Gly Leu Asn Ser Asn 
765 770 775 780 

GGC AAA CTT TCG GTG TAT GGC ACC ACT TTC ACT AAC GAA GCT AAA GAT 24 0 3 

Gly Lys Leu Ser Val Tyr Gly Thr Thr Phe Thr Asn Glu Ala Lys Asp 
785 790 795 

GGG AAA TTC ATT TTC AAT GCA GGG CAA GCG GTT TTT GAA AAC ACC AAC 24 51 

Gly Lys Phe lie Phe Asn Ala Gly Gin Ala Val Phe Glu Asn Thr Asn 
800 805 810 

TTT AAT GGA GGG AGT TAC CAA TTC AGC GGC GAT AGC TTG AAT TTT TCA 24 99 

Phe Asn Gly Gly Ser Tyr Gin Phe Ser Gly Asp Ser Leu Asn Phe Ser 
815 820 825 

AAC AAC AAC CAG TTC AAT AGC GGT TCG TTT GAA ATT AGC GCA AAA AAC 2 54 7 

Asn Asn Asn Gin Phe Asn Ser Gly Ser Phe Glu He Ser Ala Lys Asn 
830 835 840 
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GCT TCG TTC AAT AAC GCT AAC TTT AAC AAC AGC GCT TCT TTT AAT TTC 2 5 95 

Ala Ser Phe Asn Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe 
845 850 855 860 

AAT AAT TCT AAC GCG ACC ACT TCG TTT GTG GGG GAT TTC ACT AAC GCT 2 64 3 

Asn Asn Ser Asn Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala 
865 870 875 

AAT TCA AAT TTG CAA ATC GCC GGG AAC GCT GTT TTT GGG AAC TCT ACT 2 6 91 

Asn Ser Asn Leu Gin lie Ala Gly Asn Ala Val Phe Gly Asn Ser Thr 
880 885 890 

AAT GGC TCT CAA AAT ACC GCT AAT TTT AAT AAT ACC GGC TCT GTT AAT 2 73 9 

Asn Gly Ser Gin Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn- 
895 900 905 

ATT TCA GGG AAT GCA ACC TTT GAT AAT GTG GTG TTT AAT GGC CCT ACG 2 787 

lie Ser Gly Asn Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr 
910 915 920 

AAC ACG AGC GTG AAA GGG CAG GTT ACT TTA AAT AAC ATC ACT TTA AAA 2 83 5 

Asn Thr Ser Val Lys Gly Gin Val Thr Leu Asn Asn lie Thr Leu Lys 
925 930 935 940 

AAC CTG AAC GCC CCT TTG TCT TTT GGC GAT GGG ACG ATT ACT TTT AAC 2 8 83 

Asn Leu Asn Ala Pro Leu Ser Phe Gly Asp Gly Thr He Thr Phe Asn 
945 950 955 

GCT CAT TCG GTG ATT AAT ATT GCT GAA TCT ATC ACT AAT GGC AAC CCT 2 931 

Ala His Ser Val He Asn He Ala Glu Ser He Thr Asn Gly Asn Pro 
960 965 970 

ATC ACT CTT GTA AGC TCT TCT AAA GAA ATT GAA TAC AAC AAC GCT TTC 2 97 9 

He Thr Leu Val Ser Ser Ser Lys Glu He Glu Tyr Asn Asn Ala Phe 
975 980 985 

AGT AAA AAT CTA TGG CAG CTC ATC AAC TAC CAA GGG CAT GGG GCA AGC 3 02 7 

Ser Lys Asn Leu Trp Gin Leu He Asn Tyr Gin Gly His Gly Ala Ser 
990 995 1000 

AGT GAA AAG CTC GTC TCT AGC GCG GGT AAT GGC GTT TAT GAT GTG GTG 3 07 5 

Ser Glu Lys Leu Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val 
1005 1010 1015 1020 

TAT TCT TTC AAT AAC CAA ACC TAC AAT TTC CAA GAG GTT TTT TCA CAA 312 3 

Tyr Ser Phe Asn Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin 
1025 1030 1035 

AAC AGC ATT TCT ATC CGG CGT TTG GGC GTT AAC ATG GTG TTT GAT TAT 3171 
Asn Ser He Ser He Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr 
1040 1045 1050 

GTG GAT ATG GAA AAA TCG GAT CAT TTA TAT TAT CAA AAC GCT CTC GGT 3219 
Val Asp- Met Glu Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly 
1055 1060 1065 
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TTT ATG ACC TAC AUG CCT AAT AGC TAT AAC AAT AAT TTA GGG AAT GQA 326 7 

Phe Met Thr Tyr Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala 
1070 1075 1080 

AAC AAC ACC ATT TAC TAT TAC GAC AAG AGC ATT GAT TTT TAT GCG AGC 3 315 

Asn Asn Thr lie Tyr Tyr Tyr Asp Lys Ser lie Asp Phe Tyr Ala Ser 
1085 1090 1095 1100 

GGG AAA ACT CTA TTC ACT AAA GCG GAA TTT TCT CAA ACA TTC ACC GGG 3 3 63 

Gly Lys Thr Leu Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly 
1105 1110 1115 

CAA AAC AGC GCG ATC GTT TTT GGG GCT AAA AGC ATA TGG ACG AGC TTA 3411 
Gin Asn Ser Ala lie Val Phe Gly Ala Lys Ser lie Trp Thr Ser Leu 
1120 1125 1130 

AGC GAT GCA CCG CAG TCT AAC ACC ATC ATT CGC TTT GGG GAC AAT AAG 34 5 9 

Ser Asp Ala Pro Gin Ser Asn Thr lie He Arg Phe Gly Asp Asn Lys 
1135 1140 1145 

GGA GCA GGG AGT AAT GAT GCG AGC GGG CAT TGC TGG AAT TTG CAA TGC 350 7 

Gly Ala Gly Ser Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys 
1150 1155 1160 

ATA GGC TTT ATT ACA GGG CAT TAT GAA GCG CAA AAG ATT TAC ATC ACC 3 555 

He Gly Phe He Thr Gly His Tyr Glu Ala Gin Lys He Tyr He Thr 
1165 1170 1175 1180 

GGT AGC ATT GAA AGC GGG AAT CGC ATT TCT AGC GGT GGG GGC GCG AGC 3 6 03 

Gly Ser He Glu Ser Gly Asn Arg He Ser Ser Gly Gly Gly Ala Ser 
1185 1190 1195 

CTT AAT TTT AAC GGG CTT CAA GGC ATT CTT TTA ACG AAC GCG ACT TTG 3 651 

Leu Asn Phe Asn Gly Leu Gin Gly He Leu Leu Thr Asn Ala .Thr Leu 
1200 1205 1210 

TAT AAC CGC GCC GCT GGC ACG CAA AGC TCG TCT ATG AAT TTT ATC TCT 3 6 99 

Tyr Asn Arg Ala Ala Gly Thr Gin Ser Ser Ser Met Asn Phe He Ser 
1215 1220 1225 

AAC AGC GCG AAC ATT CAG GCT CAA AAC TCC TAT TTT ATA GAC GAT ACC 3 74 7 

Asn Ser Ala Asn He Gin Ala Gin Asn Ser Tyr Phe He Asp Asp Thr 
1230 1235 1240 

GCA CAA AAT GGC GGT AAC CCT AAT TTC AGT TTC AAC GCT TTG AAT CTG 3 7 95 

Ala Gin Asn Gly Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu 
1245 1250 1255 1260 

GAT TTT TCT AAC AGC TCT TTT AGA GGC TAT GTG GGG AAA ACG CAA TCT 3 84 3 

Asp Phe Ser Asn Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser 
1265 1270 1275 

GTT TTT AAA TTC AAT GCC AAG AAT GCG ATC AGT TTC ACC AAC AGC ACG 3 8 91 

Val Phe Lys Phe Asn Ala Lys Asn Ala He Ser Phe Thr Asn Ser Thr 
1280 1285 1290 
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AAT TTA AGC TCT GUT TTG TAT CAA ATG CAA GCT AAA AGC GTG TTG TT.T 3 93 9 

Asn Leu Ser Ser Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe 
1295 1300 1305 

GAC AAT TCC AAT TTA AGC GTT TCA GTG GGG AC A AGC AGT ATT AAA GCC 3 98 7 

Asp Asn Ser Asn Leu Ser Val Ser Val Gly Thr Ser Ser lie Lys Ala 
1310 1315 1320 

AAT GCG ATC AAT CTT TCT CAA AAT GCC TCT ATT AAT GCG AGC AAC CAT 4 035 

Asn Ala lie Asn Leu Ser Gin Asn Ala Ser He Asn Ala Ser Asn His 
1325 1330 1335 1340 

TCA ACC TTA GAA CTT CAA GGC GAT TTG AAT GTG AAC GAC ACC AGC TCG 4 0 83 

Ser Thr Leu Glu Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser 
1345 1350 1355 

CTC AAC CTC AAC CAA AGC ACG ATT AAT GTT TCC AAT AAC GCC ACG ATC 4131 

Leu Asn Leu Asn Gin Ser Thr He Asn Val Ser Asn Asn Ala Thr He 

1360 1365 1370 

AAC GAT TAT GCG AGC TTG ATT GCG AGT AAT GGC TCT CAC CTT AAT TTT 417 9 

Asn Asp Tyr Ala Ser Leu He Ala Ser Asn Gly Ser His Leu Asn Phe 
1375 1380 1385 

AAC GGG GCG GTT AAT TTC AAT TCA GCG AAT ATT ACT ACG AGT TTG AAT 42 2 7 

Asn Gly Ala Val Asn Phe Asn Ser Ala Asn He Thr Thr Ser Leu Asn 
1390 1395 1400 

AAT TCC TCT ATC GTG TTT AAG GGG GCG GTC TCT TTA GGA GGG CAG TTT 42 7 5 

Asn Ser Ser He Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe 
1405 1410 1415 1420 

AAT TTA AGC AAT AAC TCT TCT TTA GAT TTC CAA GGC TCT AGC GCT ATC 4 323 

Asn Leu Ser Asn Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala He 
1425 1430 1435 

ACC TCT AAC ACG GCG TTT AAT TTC TAT GAT AAC GCT TTT TCT CAA AGC 4 371 

Thr Ser Asn Thr Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser 
1440 1445 1450 

CCC ATC ACT TTC CAT CAA GCC CTT GAC ATT AAA GCG CCC TTA AGT TTG 4 419 

Pro He Thr Phe His Gin Ala Leu Asp He Lys Ala Pro Leu Ser Leu 
1455 1460 1465 

GGA GGC AAC CTT TTA AAC CCT AAC AAC AGC AGC GTG CTG GAT TTA AAA 4 4 67 

Gly Gly Asn Leu Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys 
1470 1475 1480 

AAC AGC CAG CTT GTT TTT GGC GAT CAA GGG AGT TTG AAT ATC GCT AAC 4 515 

Asn Ser Gin Leu Val Phe Gly Asp Gin Gly Ser Leu Asn He Ala Asn 
1485 1490 1495 1500 

ATT GAT TTA CTA AGC GAT CTA AAT GAT AAT AAA AAT CGT GTG TAT AAC 4 5 63 

He Asp Leu Leu Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn 
1505 1510 1515 
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ATC ATT CAA GCG GAL' aIG AAT AGT AAT -TGG TAT GAG CGT ATC AGC TTC 4 611 

lie lie Gin Ala Asp Met Asn Ser Asn Trp Tyr Glu Arg lie Ser Phe 
1520 1525 1530 

TTT GGC ATG CAC ATC AAT GAC GGG ATT TAT GAT GCT AAA AAC CAA ACT 4 65 9 

Phe Gly Met His lie Asn Asp Gly lie Tyr Asp Ala Lys Asn Gin Thr 
1535 1540 1545 

TAT AGT TTC ACT AAC CCC CTT AAT AAC GCC CTA AAA ATC ACC GAG AGC 4 7 07 

Tyr Ser Phe Thr Asn Pro Leu Asn Asn Ala Leu Lys lie Thr Glu Ser 
1550 1555 1560 

TTT AAA GAC AAC CAA CTA AGC GTT ACG CTC TCT CAA ATC CCG GGT ATT 4 75 5 

Phe Lys Asp Asn Gin Leu Ser Val Thr Leu Ser Gin He Pro Gly He 
1565 1570 1575 1580 

AAA AAC ACG CTC TAT AAC ATT GGC TCT GAA ATT TTT AAC TAC CAA AAA 4 8 03 

Lys Asn Thr Leu Tyr Asn He Gly Ser Glu He Phe Asn Tyr Gin Lys 
1585 1590 1595 

GTT TAT AAC AAC GCT AAT GGC GTG TAT TCT TAT AGC GAT GAT GCA CAA 4 851 

Val Tyr Asn Asn Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin 
1600 1605 1610 

GGC GTG TTT TAT CTC ACA AGC AAC GTG AAA GGC TAT TAC AAC CCT AAC 4 8 99 

Gly Val Phe Tyr Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn 
1615 1620 1625 

CAA TCC TAT CAA GCC AGC GGC AGT AAC AAC ACC ACG AAA AAT AAT AAT 4 94 7 

Gin Ser Tyr Gin Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn 
1630 1635 1640 

CTA ACC TCT GAA TCT TCT ATC ATC TCG CAA ACC TAT AAC GCG CAA GGC 4 995 

Leu Thr Ser Glu Ser Ser He He Ser Gin Thr Tyr Asn Ala Gin Gly 
1645 1650 1655 1660 

AAC CCT ATT AGC GCG TTG CAC ATC TAT AAC AAG GGC TAT AAT TTC AAC 5 04 3 

Asn Pro He Ser Ala Leu His He Tyr Asn Lys Gly Tyr Asn Phe Asn 
1665 1670 1675 

AAT ATC AAA GCG TTA GGG CAA ATG GCT CTC AAA CTC TAC CCT GAA ATC 5 0 91 

Asn He Lys Ala Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu He 
1680 1685 1690 

AAA AAG GTA TTA GGG AAT GAT TTT TCG CCC TCA AGT TTG AAC GCT TTA 513 9 

Lys Lys Val Leu Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu 
1695 1700 1705 

AAC TCT AAT GCG CTA AAC CAA CTT ACC AAA CTC ATC ACG CCT AAC GAC 5187 
Asn Ser Asn Ala Leu Asn Gin Leu Thr Lys Leu He Thr Pro Asn Asp 
1710 1715 1720 

TGG AAA AAC ATT AAC GAG TTG ATT GAT AAC GCA AAC AAT TCG GTG GTG 52 3 5 

Trp Lys Asn He Asn Glu Leu He Asp Asn Ala Asn Asn Ser Val Val 
1725 1730 1735 1740 
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CAA AAT TTC AAT AAC GGC ACT TTG ATT GTG GGA GCG ACT CAA ATA GGG 52 8 3 

Gin Asn Phe Asn Asn Gly Thr Leu lie Val Gly Ala Thr Gin lie Gly 
1745 1750 1755 



CAA ACA GAC ACC AAT AGC GCG GTT GTT TTT GGG GGC TTG GGC TAT CAA 5331 
Gin Thr Asp Thr Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin 
1760 1765 1770 



ACA CCT TGT GAT TAT ACT GAT ATT GTG TGC CAA AAA TTT AGA GGC ACT 537 9 

Thr Pro Cys Asp Tyr Thr Asp lie Val Cys Gin Lys Phe Arg Gly Thr 
1775 1780 1785 

TAT TTA GGA CAG CTT TTA GAG TCC AGC TCG GCT GAT TTG GGC TAT ATT 54 2 7 

Tyr Leu Gly Gin Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr lie 
1790 1795 1800 



GAC ACG ACT TTT AAC GCT AAA GAA ATT TAT CTT ACC GGC ACT TTA GGG 54 7 5 

Asp Thr Thr Phe Asn Ala Lys Glu lie Tyr Leu Thr Gly Thr Leu Gly 
1805 1810 1815 1820 

AGC GGG AAC GCA TGG GGG ACT GGG GGG AGC GCG AGC GTA ACT TTT AAC 552 3 

Ser Gly Asn Ala Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn 
1825 1830 1835 

AGC CAA ACT TCG CTC ATT CTC AAT CAG GCT AAT ATC GTA AGC TCG CAA 5571 

Ser Gin Thr Ser Leu lie Leu Asn Gin Ala Asn lie Val Ser Ser Gin 
1840 1845 1850 



ACC GAT GGG ATC TTT AGC ATG CTG GGT CAA GAG GGT ATT AAT AAG GTT 5 619 

Thr Asp Gly lie Phe Ser Met Leu Gly Gin Glu Gly lie Asn Lys Val 
1855 1860 1865 

TTC AAT CAA GCC GGG CTC GCT AAT ATT TTG GGC GAA GTG GCG GTG CAA 5 6 67 

Phe Asn Gin Ala Gly Leu Ala Asn lie Leu Gly Glu Val Ala Val Gin 
1870 1875 1880 

TCC ATC AAC AAA GCC GGG GGA TTA GGG AAT TTG ATA GTA AAT ACG CTA 5 715 

Ser lie Asn Lys Ala Gly Gly Leu Gly Asn Leu lie Val Asn Thr Leu 
1885 1890 1895 1900 

GGG AGT AAT AGC GTG ATT GGG GGG TAT TTA ACG CCT GAA CAA AAA AAT 5 7 63 

Gly Ser Asn Ser Val lie Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn 
1905 1910 1915 

CAA ACC CTA AGC CAG CTT TTA GGG CAG AAT AAC TTT GAT AAT CTC ATG 5811 
Gin Thr Leu Ser Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met 
1920 1925 1930 

AAC GAT AGC GGT TTG AAT ACG GCG ATT AAG GAT TTG ATC AGA CAA AAA 5 85 9 

Asn Asp Ser Gly Leu Asn Thr Ala lie Lys Asp Leu lie Arg Gin Lys 
1935 1940 1945 

TTA GGC TTT TGG ACC GGG CTA GTG GGG GGA TTA GCC GGA CTA GGG GGC 5 907 

Leu Gly Phe Trp Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly 
1950 1955 1960 
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ATT GAT TTG CAA AAL OCT GAA AAG CTT ATA GGC AGC ATG TCA ATC AAT 5 95 5 

He Asp Leu Gin Asn Pro Glu Lys Leu He Gly Ser Met Ser He Asn 

1965 1970 1975 1980 

GAT TTA TTG AGT AAA AAA GGG TTG TTC AAT CAG ATC ACC GGC TTT ATT 6 003 

Asp Leu Leu Ser Lys Lys Gly Leu Phe Asn Gin He Thr Gly Phe He 
1985 1990 1995 

TCC GCT AAC GAT ATA GGG CAA GTC ATA AGC GTA ATG TTG CAA GAT ATT 6 051 

Ser Ala Asn Asp He Gly Gin Val He Ser Val Met Leu Gin Asp He 

2000 2005 2010 

GTC AAA CCG AGC AAC GCT TTA AAA AAC GAT GTA GCG GCT TTA GGC AAG 6 0 99 

Val Lys Pro Ser Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys 
2015 2020 2025 

CAA ATG ATT GGC GAA TTT TTA GGC CAA GAC ACG CTC AAT TCT TTA GAA 614 7 

Gin Met He Gly Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu 
2030 2035 2040 

AGC TTG TTG CAA AAC CAG CAG ATT AAA AGC GTT TTA GAC AAA GTC CTA 6195 
Ser Leu Leu Gin Asn Gin Gin He Lys Ser Val Leu Asp Lys Val Leu 
2045 2050 2055 2060 

GCG GCT AAA GGT TTA GGG CCT ATT TAT GAA CAA GGC TTG GGG GAT TTG 624 3 

Ala Ala Lys Gly Leu Gly Pro He Tyr Glu Gin Gly Leu Gly Asp Leu 
2065 2070 2075 

ATA CCT AAT CTT GGT AAA AAA GGG CTT TTC GCT CCT TAT GGC TTG AGT 62 91 

He Pro Asn Leu Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser 
2080 2085 2090 

CAA GTG TGG CAA AAA GGG GAT TTT AGT TTC AAC GCA CAA GGC AAT GTT 63 3 9 

Gin Val Trp Gin Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val 
2095 2100 2105 

TTT GTG CAA AAT TCC ACT TTC TCT AAC GCC AAT GGA GGC ACG CTC TCT 6 3 87 

Phe Val Gin Asn Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser 
2110 2115 2120 

TTT AAC GCA GGA AAT TCG CTC ATT TTT GCC GGA AAC AAT CAT ATT GCA 64 3 5 

Phe Asn Ala Gly Asn Ser Leu He Phe Ala Gly Asn Asn His He Ala 
2125 2130 2135 2140 

TTC ACT AAC CAC GCT GGA ACT CTT CAA TTA TTG TCC GAT CAA GTT TCT 64 8 3 

Phe Thr Asn His Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser 
2145 2150 2155 

AAC ATT AAC ATC ACC ACG CTT AAC GCT AGC AAC GGC CTT AAG ATT AAC 65 31 

Asn He Asn He Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys lie Asn 
2160 2165 2170 

GCC GCT AAT AAC AAT GTT TCT GTG TCT CAA GGC AAT CTG TTT GTC AGC 657 9 

Ala Ala Asn Asn Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser 
2175 2180 2185 
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GCT AGC TGC GCG CAA CAA AGC GAT CCA ACT ACA GCT AAT ATT GCA AAC 6 62 7 

Ala Ser Cys Ala Gin Gin Ser Asp Pro Thr Thr Ala Asn lie Ala Asn 
2190 2195 2200 

CCT TGC GCG CTT AGC GCC CAA AGC ACG AAT GGC GCT TCT TCT AAT AAT 6 67 5 

Pro Cys Ala Leu Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn 
2205 2210 2215 2220 

GCG TCA AAT AAC GCG CCA ATC GCC TTG AGT AAT AAC GAT GAA AGC TTG 6 72 3 

Ala Ser Asn Asn Ala Pro lie Ala Leu Ser Asn Asn Asp Glu Ser Leu 
2225 2230 2235 

ATG GTT GCG GCG AAT GAT TTC AAT TTT TCA GGC AAT ATT TAC GCT AAT 6771 
Met Val Ala Ala Asn Asp Phe Asn Phe Ser Gly Asn lie Tyr Ala Asn 
2240 2245 2250 

GGG GTG GTT GAT TTT TCA AAG ATT AAA GGC TCT GCA AAC ATT AAA AAC 6 819 

Gly Val Val Asp Phe Ser Lys lie Lys Gly Ser Ala Asn lie Lys Asn 
2255 2260 2265 

CTG TAT CTT TAC AAT AAC GCT CAA TTC CAA GCC AAC AAT CTC ACT ATT 6 86 7 

Leu Tyr Leu Tyr Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr lie 
2270 2275 2280 

TCC AAT CAA GCG GTG TTA GAA AAA AAC GCC AGC TTT GTA ACG AAT AAT 6 915 

Ser Asn Gin Ala Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn 
2285 2290 2295 2300 

TTA AAC ATT CAA GGA GCG TTT AAC AAC AAC GCC ACG CAA AAA ATA GAG 6 96 3 

Leu Asn lie Gin Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys lie Glu 
2305 2310 2315 

GTG CTT CAA AAT TTA GTG ATC GCT TCA AAC GCT TCT TTA AGC ACC GGG 7 011 

Val Leu Gin Asn Leu Val lie Ala Ser Asn Ala Ser Leu Ser Thr Gly 
2320 2325 2330 

ATT TAT GGG TTA GAA GTA GGG GGG GCT TTG AAT AAT TCT GGA GCG ATC 705 9 

lie Tyr Gly Leu Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala lie 
2335 2340 2345 

CAT TTT AAT TTA GAA AAT ACC CAA ACG CCA ACG CCG CTC ATT CAA GCA 710 7 

His Phe Asn Leu Glu Asn Thr Gin Thr Pro Thr Pro Leu lie Gin Ala 
2350 2355 2360 

GAG GGG ATC ATT AAC CTC AAC ACC ACC CAA ACG CCT TTT ATG AAT GTC 7155 
Glu Gly lie lie Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val 
2365 2370 2375 2380 

AAT AAC AGC ATG GCC AAT AAT ACG ACT TAC ACT TTA TTA AAA AGC AGC 72 0 3 

Asn Asn Ser Met Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser 
2385 2390 2395 

CGT TAC ATT GAT TAC AAT ATC AAC CCC AAC AGC TTG CAA TCG TAT TTG 72 51 

Arg Tyr lie Asp Tyr Asn lie Asn Pro Asn Ser Leu Gin Ser Tyr Leu 
2400 2405 2410 
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AAT CTC TAC ACT TTA ATC -AAT ATC AAC GGG AAC CAC ATA GAG GAA AAA 7 2 99 

Asn Leu Tyr Thr Leu lie Asn He Asn Gly Asn His He Glu Glu Lys 
2415 2420 2425 

AAC GGC GCA TTG ACT TAT TTG GGC CAA CGG GTT TTG TTG CAA GAT AAG 734 7 

Asn Gly Ala Leu Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys 
2430 2435 2440 

GGG TTA TTG TTA AGC GTA GCG CTG CCC AAC TCA AAC AAC GCT TCT CAA 73 95 

Gly Leu Leu Leu Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin 
2445 2450 2455 2460 

AAC AAC ATT TTA AGC CTT TCT GTC CTT TAT AAC CAA GTT AAA ATG TCT 74 43 

Asn Asn He Leu Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser 
2465 2470 2475 

TGC GGC GAT AAA GCG ATG GAT TTT ACC CCC CCT ACC TTA CAA GAT TAC 74 91 

Cys Gly Asp Lys Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr 
2480 2485 2490 

ATT GTG GGC ATT CAA GGG CAA AGC GCG CTC AAT CAA ATT GAA GCT GTT 7 53 9 

He Val Gly He Gin Gly Gin Ser Ala Leu Asn Gin He Glu Ala Val 
2495 2500 2505 

GGG GGG AAC GCT ATC AAG TGG CTT TCA ACA TTG ATG ATG GAG ACT AAA 7 5 87 

Gly Gly Asn Ala He Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys 
2510 2515 2520 

GAA AAC CCG TTT TTT GCG CCG ATT TAT TTA AAA AAC CAC TCT TTG AAT 7 63 5 

Glu Asn Pro Phe Phe Ala Pro He Tyr Leu Lys Asn His Ser Leu Asn 
2525 2530 2535 2540 

GAA ATC TTA GGC GTA ACA AAA GAT CTT CAA AAC ACC GCA AGC TTG ATT 7 683 

Glu He Leu Gly Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu He 
2545 2550 2555 

TCT AAC CCT AAT TTT AGA GAT AAC GCT ACC AAT CTT TTA GAA TTG GCG 7 731 

Ser Asn Pro Asn Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala 
2560 2565 2570 

AGT TAC ACC CAA CAA ACC AGC CGT TTA ACA AAA CTC TCT GAT TTT AGA 77 7 9 

Ser Tyr Thr Gin Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg 
2575 2580 2585 

TCT AGA GAG GGA GAG TCT GAT TTT TCT TTG TTA GAG CTT AAA AAC AAG 7 827 

Ser Arg Glu Gly Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys 
2590 2595 2600 

CGT TTT AGC GAT CCT AAT CCA GAG GTT TTT GTC AAA TAC TCT CAA CTT 7 8 75 

Arg Phe Ser Asp Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu 
2605 2610 2615 2620 

AGC AAA CAC CCA AAT AAC CTT TGG GTT CAA GGG GTG GGA GGA GCG AGC 7 92 3 

Ser Lys His Pro Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser 
2625 2630 2635 



BNSDOCID: <WO 9821225A1 J_> 



WO 98/21225 



PCT7US97/21353 



-329- 

TTT ATT TCT GGG GGC AAT -GGC ACG CTT -TAT GGC TTG AAT GCG GGC TAT 7 971 

Phe lie Ser Gly Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr 

2640 2645 2650 

GAC AGG TTG GTT AAA AAT GTG ATC CTT GGG GGT TAT GTG GCT TAT GGC 8 019 

Asp Arg Leu Val Lys Asn Val lie Leu Gly Gly Tyr Val Ala Tyr Gly 
2655 2660 2665 

TAT AGC GAC TTT AAT GGG AAC ATC ATG CAT TCT TTG GGT AAT AAT GTG 8 0 67 

Tyr Ser Asp Phe Asn Gly Asn lie Met His Ser Leu Gly Asn Asn Val 
2670 2675 2680 

GAT GTG GGG ATG TAT GCG AGG GCT TTT TTA AAA AGG AAC GAA TTC ACT 8115 
Asp Val Gly Met Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr 
2685 2690 2695 2700 

TTG AGC GCG AAT GAA ACT TAT GGA GGC AAT GCA ACT AGT ATC AAT TCT 816 3 

Leu Ser Ala Asn Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser 
2705 2710 2715 

TCT AAT TCT TTG CTC TCT GTG TTG AAC CAA CGC TAC AAC TAC AAC ACC 8211 
Ser Asn Ser Leu Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr 
2720 2725 2730 

TGG ACA ACG AGC GTG AAC GGG AAT TAC GGC TAT GAT TTC ATG TTC AAA 82 5 9 

Trp Thr Thr Ser Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys 
2735 2740 2745 

CAA AAA AGC GTG GTG CTA AAA CCT CAA GTG GGT TTG AGC TAT CAT TTC 83 0 7 

Gin Lys Ser Val Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe 
2750 2755 2760 

ATA GGT CTA AGT GGG ATG AAA GGC AAT GAT GCC GCT TAC AAA CAA TTC 8 3 55 

He Gly Leu Ser Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe 
2765 2770 2775 2780 

CTC ATG CAT TCA AAC CCC TCT AAC GAA TCG GTT TTA ACG CTC AAC ATG 84 03 

Leu Met His Ser Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met 
2785 2790 2795 

GGG TTG GAG AGC CGT AAA TAT TTT GGT AAA AAT TCC TAT TAT TTT GTA 84 51 

Gly Leu Glu Ser Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val 
2800 2805 2810 

ACG GCG AGA CTA GGT AGG GAT CTT TTG ATC AAA TCT AAA GGC AGC AAT 84 99 

Thr Ala Arg Leu Gly Arg Asp Leu Leu He Lys Ser Lys Gly Ser Asn 
2815 2820 2825 

ACG GTG CGT TTT GTG GGC GAA AAC ACT TTA TTG TAT CGC AAG GGG GAA 854 7 

Thr Val Arg Phe Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu 
2830 2835 2840 

GTT TTT AAC ACT TTT GCG AGC GTG ATT ACA GGG GGC GAA ATG CAT TTG 8 5 95 

Val Phe Asn Thr Phe Ala Ser Val lie Thr Gly Gly Glu Met His Leu 
2845 2850 2855 2860 
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TGG CGT TTG GTG TAT GTG -AAT GCG GGG GTG GGG CTT AAG ATG GGC TTG 8 64 3 

Trp Arg Leu Val Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu 
2865 2870 2875 

CAA TAC CAA GAT ATT AAT ATA ACC GGG AAT GTG GGC ATG CGA GTG GCG 8691 

Gin Tyr Gin Asp lie Asn He Thr Gly Asn Val Gly Met Arg Val Ala 
2880 2885 2890 

TTT TAGCTTTTTT GCTATAATGC TTCGTTCAAA TTTTATGGTT AGGTTTTTCT ATGT 874 8 

Phe 



(2) INFORMATION FOR SEQ ID NO: 16 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2893 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
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He 
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He 
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Trp 
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35 
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Trp 
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Trp 
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Trp 
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Ala 
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Tyr 
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Asn 
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Asn 
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He 
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Glu 


Thr 
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He 
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Tyr 


Asn 
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He 
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Ala 
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180 










185 
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Leu 
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He 
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He 
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Ala 
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He 
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He 
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Ile Gly Ala Asn C'ys aer- Thr Ser Gly Pro Ser Tyr Ser Phe Lys Gly 

245 250 255 

Thr Thr Asn Ala Thr Asn Thr Ala Phe Ser Asn Ala Ser Gly Ser Phe 

260 265 270 

Thr Phe Glu Glu Asn Ala Thr Phe Ser Gly Ala Lys Trp Asn Gly Gly 

275 280 285 

Thr Tyr Thr Phe Asn Lys Glu Phe Ser Ala Thr Asn Asn Thr Ala Phe 

290 295 300 

Ser Ser Gly Ser Phe Asn Phe Lys Gly Val Ser Ser Phe Asn Gly Thr 
305 310 315 320 

Ser Phe Ser Asn Ala Ser Tyr Thr Phe Asp Asn Gin Ala Thr Phe Gin 

325 330 335 

Asn Ser Ser Phe Asn Gly Gly Thr Phe Thr Phe Asn Asn Gin Thr Asn 

340 345 350 

Pro Thr Asn Asn Ala Gin His Pro Gin lie Gin Asn Ser Ser Phe Ser 

355 360 365 

Gly Asn Ala Thr Thr Leu Lys Gly Phe Val Asn Phe Gin Gin Ala Phe 

370 375 380 

Asn Asn Ser Asn His Gin Leu Thr lie Gin Asn Ala Ser Phe Asn Asn 
385 390 395 400 

Ala Thr Phe Asn Asn Thr Gly Lys lie Thr lie Glu Lys Asp Ala Ser 

405 410 415 

Phe Asn Asn Thr Thr Phe Asn Thr Ser Val Asp Thr Asn Asn Met Ser 

420 425 430 

Val Thr Gly Gly Val Thr Leu Ser Gly Lys Asn Asp Leu Lys Asn Gly 

435 440 445 

Ser Thr Leu Asp Phe Gly Ser Ser Lys lie Thr Leu Ala Gin Gly Thr 

450 455 460 

Thr Phe Asn Leu Thr Ser Leu Gly Ser Glu Lys Ser Val Thr lie Leu 
465 470 475 480 

Asn Ser Ser Gly Gly lie Thr Tyr Ser Asn Leu Leu Asn His Ala He 

485 490 495 

Asn Gly Leu Thr Ser Ala Leu Lys Thr Asn Glu Ser Leu Ser Asn Pro 

500 505 510 

Gin Ser Phe Ala Gin Gly Leu Trp Asp lie lie Thr Tyr Asn Gly Val 

515 520 525 

Thr Gly Gin Leu Leu Asn Glu Asn Ala Ala Thr Ser Lys Pro Thr Asp 

530 535 540 

Ser Ser Pro Ser Lys Ser Ser Thr Asn Ser Thr Gin Val Tyr Gin Val 
545 550 555 560 

Gly Tyr Lys lie Gly Asp Thr lie Tyr Lys Leu Gin Glu Thr Phe Ser 

565 570 575 

His Asn Ser lie lie lie Gin Ala Leu Glu Ser Gly Thr Tyr Thr Pro 

580 585 590 

Pro Pro Val lie Asn Gly Ser Lys Phe Asp Leu Ser Ala Ser Asn Tyr 

595 600 605 

lie Asn Ala Asp Met Pro Trp Tyr Asp His Lys Tyr Tyr lie Pro Lys 

610 615 620 

Ser Gin Asn Phe Thr Glu Ser Gly Thr Tyr Tyr Leu Pro Ser Val Gin 
625 630 635 640 

lie Trp Gly Ser Tyr Thr Asn Ser Phe Lys Gin Thr Phe Ser Ala Asn 

645 650 655 

Gly Ser Asn Leu Val lie Gly Tyr Asn Ser Thr Trp Thr Asp His Asn 

660 665 670 

Val Ser Ser Ser Gly Thr Val Ser Phe Gly Asp Thr Ser Gly Ser Ala 
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67b 680 685 

Leu Asn Gly His Cys Gly Pro Trp Pro Tyr Tyr Gin Cys Thr Gly Thr 

690 695 700 

Thr Asn Gly Thr Tyr Ser Ala Tyr His Val Tyr lie Thr Ala Asn Leu 
705 710 715 720 

Arg Ser Gly Asn Arg lie Gly Thr Gly Gly Ala Ala Asn Leu He Phe 

725 730 735 

Asn Gly Val Asp Ser He Asn He Ala Asn Ala Thr He Thr Gin His 

740 745 750 

Asn Ala Gly He Tyr Ser Ser Ser Met Thr Phe Ser Thr Gin Ser Met 

755 760 765 

Asp Asn Ser Gin Asn Leu Asn Gly Leu Asn Ser Asn Gly Lys Leu Ser 

770 775 780 

Val Tyr Gly Thr Thr Phe Thr Asn Glu Ala Lys Asp Gly Lys Phe He 
785 790 795 800 

Phe Asn Ala Gly Gin Ala Val Phe Glu Asn Thr Asn Phe Asn Gly Gly 

805 810 815 

Ser Tyr Gin Phe Ser Gly Asp Ser Leu Asn Phe Ser Asn Asn Asn Gin 

820 825 830 

Phe Asn Ser Gly Ser Phe Glu He Ser Ala Lys Asn Ala Ser Phe Asn 

835 840 845 

Asn Ala Asn Phe Asn Asn Ser Ala Ser Phe Asn Phe Asn Asn Ser Asn 

850 855 860 

Ala Thr Thr Ser Phe Val Gly Asp Phe Thr Asn Ala Asn Ser Asn Leu 
865 870 875 880 

Gin He Ala Gly Asn Ala Val Phe Gly Asn Ser Thr Asn Gly Ser Gin 

885 890 895 

Asn Thr Ala Asn Phe Asn Asn Thr Gly Ser Val Asn He Ser Gly Asn 

900 905 910 

Ala Thr Phe Asp Asn Val Val Phe Asn Gly Pro Thr Asn Thr Ser Val 

915 920 925 

Lys Gly Gin Val Thr Leu Asn Asn He Thr Leu Lys Asn Leu Asn Ala 

930 935 940 

Pro Leu Ser Phe Gly Asp Gly Thr He Thr Phe Asn Ala His Ser Val 
945 950 955 960 

He Asn He Ala Glu Ser He Thr Asn Gly Asn Pro He Thr Leu Val 

965 970 975 

Ser Ser Ser Lys Glu He Glu Tyr Asn Asn Ala Phe Ser Lys Asn Leu 

980 985 990 

Trp Gin Leu He Asn Tyr Gin Gly His Gly Ala Ser Ser Glu Lys Leu 

995 1000 1005 

Val Ser Ser Ala Gly Asn Gly Val Tyr Asp Val Val Tyr Ser Phe Asn 

1010 1015 1020 

Asn Gin Thr Tyr Asn Phe Gin Glu Val Phe Ser Gin Asn Ser He Ser 
025 1030 1035 1040 

He Arg Arg Leu Gly Val Asn Met Val Phe Asp Tyr Val Asp Met Glu 

1045 1050 1055 

Lys Ser Asp His Leu Tyr Tyr Gin Asn Ala Leu Gly Phe Met Thr Tyr 

1060 1065 1070 

Met Pro Asn Ser Tyr Asn Asn Asn Leu Gly Asn Ala Asn Asn Thr He 

1075 1080 1085 

Tyr Tyr Tyr Asp Lys Ser He Asp Phe Tyr Ala Ser Gly Lys Thr Leu 

1090 ' 1095 HOO 

Phe Thr Lys Ala Glu Phe Ser Gin Thr Phe Thr Gly Gin Asn Ser Ala 
105 HIO 1H5 H20 
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Ile Val Phe Gly Ala Lys-3er lie Trp Thr Ser Leu Ser Asp Ala Pro 

1125 1130 1135 

Gin Ser Asn Thr lie lie Arg Phe Gly Asp Asn Lys Gly Ala Gly Ser 

1140 1145 1150 

Asn Asp Ala Ser Gly His Cys Trp Asn Leu Gin Cys lie Gly Phe lie 

1155 1160 1165 

Thr Gly His Tyr Glu Ala Gin Lys lie Tyr lie Thr Gly Ser lie Glu 

1170 1175 1180 

Ser Gly Asn Arg lie Ser Ser Gly Gly Gly Ala Ser Leu Asn Phe Asn 
185 1190 1195 1200 

Gly Leu Gin Gly lie Leu Leu Thr Asn Ala Thr Leu Tyr Asn Arg Ala 

1205 1210 1215 

Ala Gly Thr Gin Ser Ser Ser Met Asn Phe lie Ser Asn Ser Ala Asn 

1220 1225 1230 

lie Gin Ala Gin Asn Ser Tyr Phe lie Asp Asp Thr Ala Gin Asn Gly 

1235 1240 1245 

Gly Asn Pro Asn Phe Ser Phe Asn Ala Leu Asn Leu Asp Phe Ser Asn 

1250 1255 1260 

Ser Ser Phe Arg Gly Tyr Val Gly Lys Thr Gin Ser Val Phe Lys Phe 
265 1270 1275 1280 

Asn Ala Lys Asn Ala lie Ser Phe Thr Asn Ser Thr Asn Leu Ser Ser 

1285 1290 1295 

Gly Leu Tyr Gin Met Gin Ala Lys Ser Val Leu Phe Asp Asn Ser Asn 

1300 1305 1310 

Leu Ser Val Ser Val Gly Thr Ser Ser lie Lys Ala Asn Ala lie Asn 

1315 1320 1325 

Leu Ser Gin Asn Ala Ser lie Asn Ala Ser Asn His Ser Thr Leu Glu 

1330 1335 1340 

Leu Gin Gly Asp Leu Asn Val Asn Asp Thr Ser Ser Leu Asn Leu Asn 
345 1350 1355 1360 

Gin Ser Thr lie Asn Val Ser Asn Asn Ala Thr lie Asn Asp Tyr Ala 

1365 1370 1375 

Ser Leu lie Ala Ser Asn Gly Ser His Leu Asn Phe Asn Gly Ala Val 

1380 1385 1390 

Asn Phe Asn Ser Ala Asn He Thr Thr Ser Leu Asn Asn Ser Ser He 

1395 1400 1405 

Val Phe Lys Gly Ala Val Ser Leu Gly Gly Gin Phe Asn Leu Ser Asn 

1410 1415 1420 

Asn Ser Ser Leu Asp Phe Gin Gly Ser Ser Ala He Thr Ser Asn Thr 
425 1430 1435 1440 

Ala Phe Asn Phe Tyr Asp Asn Ala Phe Ser Gin Ser Pro He Thr Phe 

1445 1450 1455 

His Gin Ala Leu Asp He Lys Ala Pro Leu Ser Leu Gly Gly Asn Leu 

1460 1465 1470 

Leu Asn Pro Asn Asn Ser Ser Val Leu Asp Leu Lys Asn Ser Gin Leu 

1475 1480 1485 

Val Phe Gly Asp Gin Gly Ser Leu Asn He Ala Asn He Asp Leu Leu 

1490 1495 1500 

Ser Asp Leu Asn Asp Asn Lys Asn Arg Val Tyr Asn He He Gin Ala 
505 1510 1515 1520 

Asp Met Asn Ser Asn Trp Tyr Glu Arg He Ser Phe Phe Gly Met His 

1525 1530 1535 

He Asn Asp Gly He Tyr Asp Ala Lys Asn Gin Thr Tyr Ser Phe Thr 

1540 1545 1550 

Asn Pro Leu Asn Asn Ala Leu Lys He Thr Glu Ser Phe Lys Asp Asn 
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1555 1560 1565 

Gin Leu Ser Val Thr Leu Ser Gin lie Pro Gly He Lys Asn Thr Leu 

1570 1575 1580 

Tyr Asn He Gly Ser Glu He Phe Asn Tyr Gin Lys Val Tyr Asn Asn 
585 1590 1595 1600 

Ala Asn Gly Val Tyr Ser Tyr Ser Asp Asp Ala Gin Gly Val Phe Tyr 

1605 1610 1615 

Leu Thr Ser Asn Val Lys Gly Tyr Tyr Asn Pro Asn Gin Ser Tyr Gin 

1620 1625 1630 

Ala Ser Gly Ser Asn Asn Thr Thr Lys Asn Asn Asn Leu Thr Ser Glu 

1635 1640 1645 

Ser Ser He He Ser Gin Thr Tyr Asn Ala Gin Gly Asn Pro He Ser 

1650 1655 1660 

Ala Leu His He Tyr Asn Lys Gly Tyr Asn Phe Asn Asn He Lys Ala 
665 1670 1675 1680 

Leu Gly Gin Met Ala Leu Lys Leu Tyr Pro Glu He Lys Lys Val Leu 

1685 1690 1695 

Gly Asn Asp Phe Ser Pro Ser Ser Leu Asn Ala Leu Asn Ser Asn Ala 

1700 1705 1710 

Leu Asn Gin Leu Thr Lys Leu He Thr Pro Asn Asp Trp Lys Asn He 

1715 1720 1725 

Asn Glu Leu lie Asp Asn Ala Asn Asn Ser Val Val Gin Asn Phe Asn 

1730 1735 1740 

Asn Gly Thr Leu He Val Gly Ala Thr Gin He Gly Gin Thr Asp Thr 
745 1750 1755 1760 

Asn Ser Ala Val Val Phe Gly Gly Leu Gly Tyr Gin Thr Pro Cys Asp 

1765 1770 1775 

Tyr Thr Asp He Val Cys Gin Lys Phe Arg Gly Thr Tyr Leu Gly Gin 

1780 1785 1790 

Leu Leu Glu Ser Ser Ser Ala Asp Leu Gly Tyr He Asp Thr Thr Phe 

1795 1800 1805 

Asn Ala Lys Glu He Tyr Leu Thr Gly Thr Leu Gly Ser Gly Asn Ala 

1810 1815 1820 

Trp Gly Thr Gly Gly Ser Ala Ser Val Thr Phe Asn Ser Gin Thr Ser 
825 1830 1835 1840 

Leu He Leu Asn Gin Ala Asn He Val Ser Ser Gin Thr Asp Gly He 

1845 1850 1855 

Phe Ser Met Leu Gly Gin Glu Gly He Asn Lys Val Phe Asn Gin Ala 

I860 1865 1870 

Gly Leu Ala Asn He Leu Gly Glu Val Ala Val Gin Ser He Asn Lys 

1875 1880 1885 

Ala Gly Gly Leu Gly Asn Leu He Val Asn Thr Leu Gly Ser Asn Ser 

1890 1895 1900 

Val He Gly Gly Tyr Leu Thr Pro Glu Gin Lys Asn Gin Thr Leu Ser 
905 1910 1915 1920 

Gin Leu Leu Gly Gin Asn Asn Phe Asp Asn Leu Met Asn Asp Ser Gly 

1925 1930 1935 

Leu Asn Thr Ala He Lys Asp Leu He Arg Gin Lys Leu Gly Phe Trp 

1940 1945 1950 

Thr Gly Leu Val Gly Gly Leu Ala Gly Leu Gly Gly He Asp Leu Gin 

1955 I960 1965 

Asn Pro Glu Lys Leu He Gly Ser Met Ser He Asn Asp Leu Leu Ser 

1970 1975 1980 

Lys Lys Gly Leu Phe Asn Gin He Thr Gly Phe He Ser Ala Asn Asp 
985 1990 1995 2000 
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Ile Gly Gin Vai lie Ser-Val Met Leu Gin Asp lie Val Lys Pro Ser 

2005 2010 2015 

Asn Ala Leu Lys Asn Asp Val Ala Ala Leu Gly Lys Gin Met lie Gly 

2020 2025 2030 

Glu Phe Leu Gly Gin Asp Thr Leu Asn Ser Leu Glu Ser Leu Leu Gin 

2035 2040 2045 

Asn Gin Gin lie Lys Ser Val Leu Asp Lys Val Leu Ala Ala Lys Gly 

2050 2055 2060 

Leu Gly Pro lie Tyr Glu Gin Gly Leu Gly Asp Leu lie Pro Asn Leu 
065 2070 2075 2080 

Gly Lys Lys Gly Leu Phe Ala Pro Tyr Gly Leu Ser Gin Val Trp Gin 

2085 2090 2095 

Lys Gly Asp Phe Ser Phe Asn Ala Gin Gly Asn Val Phe Val Gin Asn 

2100 2105 2110 

Ser Thr Phe Ser Asn Ala Asn Gly Gly Thr Leu Ser Phe Asn Ala Gly 

2115 2120 2125 

Asn Ser Leu lie Phe Ala Gly Asn Asn His lie Ala Phe Thr Asn His 

2130 2135 2140 

Ala Gly Thr Leu Gin Leu Leu Ser Asp Gin Val Ser Asn lie Asn lie 
145 2150 2155 2160 

Thr Thr Leu Asn Ala Ser Asn Gly Leu Lys He Asn Ala Ala Asn Asn 

2165 2170 2175 

Asn Val Ser Val Ser Gin Gly Asn Leu Phe Val Ser Ala Ser Cys Ala 

2180 2185 2190 

Gin Gin Ser Asp Pro Thr Thr Ala Asn He Ala Asn Pro Cys Ala Leu 

2195 2200 2205 

Ser Ala Gin Ser Thr Asn Gly Ala Ser Ser Asn Asn Ala Ser Asn Asn 

2210 2215 2220 

Ala Pro lie Ala Leu Ser Asn Asn Asp Glu Ser Leu Met Val Ala Ala 
225 2230 2235 2240 

Asn Asp Phe Asn Phe Ser Gly Asn He Tyr Ala Asn Gly Val Val Asp 

2245 2250 2255 

Phe Ser Lys He Lys Gly Ser Ala Asn He Lys Asn Leu Tyr Leu Tyr 

2260 2265 2270 

Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr He Ser Asn Gin Ala 

2275 2280 2285 

Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn Leu Asn He Gin 

2290 2295 2300 

Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys He Glu Val Leu Gin Asn 
305 2310 2315 2320 

Leu Val He Ala Ser Asn Ala Ser Leu Ser Thr Gly He Tyr Gly Leu 

2325 2330 2335 

Glu Val Gly Gly Ala Leu Asn Asn Ser Gly Ala He His Phe Asn Leu 

2340 2345 2350 

Glu Asn Thr Gin Thr Pro Thr Pro Leu He Gin Ala Glu Gly He He 

2355 2360 2365 

Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val Asn Asn Ser Met 

2370 2375 2380 

Ala Asn Asn Thr Thr Tyr Thr Leu Leu Lys Ser Ser Arg Tyr He Asp 
385 2390 2395 2400 

Tyr Asn He Asn Pro Asn Ser Leu Gin Ser Tyr Leu Asn Leu Tyr Thr 

2405 2410 2415 

Leu He Asn He Asn Gly Asn His He Glu Glu Lys Asn Gly Ala Leu 

2420 2425 2430 

Thr Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys Gly Leu Leu Leu 
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2435 2440 2445 

Ser Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin Asn Asn lie Leu 

2450 2455 2460 

Ser Leu Ser Val Leu Tyr Asn Gin Val Lys Met Ser Cys Gly Asp Lys 
465 2470 2475 2480 

Ala Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr lie Val Gly lie 

2485 2490 2495 

Gin Gly Gin Ser Ala Leu Asn Gin lie Glu Ala Val Gly Gly Asn Ala 

2500 2505 2510 

lie Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys Glu Asn Pro Phe 

2515 2520 2525 

Phe Ala Pro lie Tyr Leu Lys Asn His Ser Leu Asn Glu lie Leu Gly 

2530 2535 2540 

Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu lie Ser Asn Pro Asn 
545 2550 2555 2560 

Phe Arg Asp Asn Ala Thr Asn Leu Leu Glu Leu Ala Ser Tyr Thr Gin 

2565 2570 2575 

Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg Ser Arg Glu Gly 

2580 2585 2590 

Glu Ser Asp Phe Ser Leu Leu Glu Leu Lys Asn Lys Arg Phe Ser Asp 

2595 2600 2605 

Pro Asn Pro Glu Val Phe Val Lys Tyr Ser Gin Leu Ser Lys His Pro 

2610 2615 2620 

Asn Asn Leu Trp Val Gin Gly Val Gly Gly Ala Ser Phe lie Ser Gly 
625 2630 2635 2640 

Gly Asn Gly Thr Leu Tyr Gly Leu Asn Ala Gly Tyr Asp Arg Leu Val 

2645 2650 2655 

Lys Asn Val He Leu Gly Gly Tyr Val Ala Tyr Gly Tyr Ser Asp Phe 

2660 2665 2670 

Asn Gly Asn He Met His Ser Leu Gly Asn Asn Val Asp Val Gly Met 

2675 2680 2685 

Tyr Ala Arg Ala Phe Leu Lys Arg Asn Glu Phe Thr Leu Ser Ala Asn 

2690 2695 2700 

Glu Thr Tyr Gly Gly Asn Ala Thr Ser He Asn Ser Ser Asn Ser Leu 
705 2710 2715 2720 

Leu Ser Val Leu Asn Gin Arg Tyr Asn Tyr Asn Thr Trp Thr Thr Ser 

2725 2730 2735 

Val Asn Gly Asn Tyr Gly Tyr Asp Phe Met Phe Lys Gin Lys Ser Val 

2740 2745 2750 

Val Leu Lys Pro Gin Val Gly Leu Ser Tyr His Phe He Gly Leu Ser 

2755 2760 2765 

Gly Met Lys Gly Asn Asp Ala Ala Tyr Lys Gin Phe Leu Met His Ser 

2770 2775 2780 

Asn Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met Gly Leu Glu Ser 
785 2790 2795 2800 

Arg Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val Thr Ala Arg Leu 

2805 2810 2815 

Gly Arg Asp Leu Leu He Lys Ser Lys Gly Ser Asn Thr Val Arg Phe 

2820 2825 2830 

Val Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu Val Phe Asn Thr 

2835 2840 2845 

Phe Ala Ser Val He Thr Gly Gly Glu Met His Leu Trp Arg Leu Val 

2850 2855 2860 

Tyr Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu Gin Tyr Gin Asp 
865 2870 2875 2880 
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Ile Asn lie Thr Gly Asn- Val Gly Met Arg Val Ala Phe 
2885 2890 



(2) INFORMATION FOR SEQ ID NO: 167; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 
<ix) FEATURE: 

(A) NAME /KEY : Coding Sequence 

(B) LOCATION: 13... 13 3 8 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

TGGTAGTTAA GA ATG GGT AAT CAT TTT TCT AAA TTA GGA TTT GTT TTA GCC 51 
Met Gly Asn His Phe Ser Lys Leu Gly Phe Val Leu Ala 
1*5 10 

GCA TTA GGA AGC GCG ATA GGT TTA GGG CAT ATC TGG CGT TTC CCC TAC 99 
Ala Leu Gly Ser Ala lie Gly Leu Gly His lie Trp Arg Phe Pro Tyr 
15 20 25 

ATG ACT GGG GTG AGT GGT GGG GGT GCT TTT GTT TTA TTG TTT TTA TTT 14 7 

Met Thr Gly Val Ser Gly Gly Gly Ala Phe Val Leu Leu Phe Leu Phe 
30 35 40 45 

TTA TCT TTA AGC GTT GGC GCG GCG ATG TTT ATC GCT GAA ATG CTA TTA 195 
Leu Ser Leu Ser Val Gly Ala Ala Met Phe lie Ala Glu Met Leu Leu 
50 55 60 

GGA CAA AGC ACT CAA AAA AAT GTA ACA GAA GCT TTT AAA GAG CTT GAC 24 3 

Gly Gin Ser Thr Gin Lys Asn Val Thr Glu Ala Phe Lys Glu Leu Asp 
65 70 75 

ATT AAC CCC AAA AAA CGC TGG AAA TAC GCA GGG CTT TTG CTT GTT TCT 2 91 

lie Asn Pro Lys Lys Arg Trp Lys Tyr Ala Gly Leu Leu Leu Val Ser 
80 85 90 

GGG CCA TTA ATA CTG ACT TTT TAC GGC ACG ATT TTA GGT TGG GTG CTT 33 9 

Gly Pro Leu He Leu Thr Phe Tyr Gly Thr He Leu Gly Trp Val Leu 
95 100 105 

TAT TAT TTG GTG AGT GTT AGT TTT AAT TTG CCT AAC AAT ATC CAA GAA 3 87 

Tyr Tyr Leu Val Ser Val Ser Phe Asn Leu Pro Asn Asn He Gin Glu 
110 115 120 125 
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TCT GAA CAA ATT TTT ACT -CAA ACT TTG CAG TCT ATA GGG CTA CAA TCC 43 5 

Ser Glu Gin lie Phe Thr Gin Thr Leu Gin Ser He Gly Leu Gin Ser 
130 135 140 

ATA GGG CTT TTT AGC GTT TTA TTG ATA ACC GGA TGG ATT GTT TCT AGG 4 83 

He Gly Leu Phe Ser Val Leu Leu He Thr Gly Trp He Val Ser Arg 
145 150 155 

GGG ATT AAA GAA GGC ATT GAA AAG CTC AAT TTG GTT TTA ATG CCC TTA 531 
Gly He Lys Glu Gly He Glu Lys Leu Asn Leu Val Leu Met Pro Leu 
160 165 170 

CTC TTT GCT ACT TTT TTT GGT TTG CTT TTC TAT GCG ATG AGC ATG GAT 57 9 

Leu Phe Ala Thr Phe Phe Gly Leu Leu Phe Tyr Ala Met Ser Met Asp 
175 180 185 

TCT TTT TCT AAA GCT TTT CAT TTC ATG TTT GAT TTC AAA CCA AAA GAT 62 7 

Ser Phe Ser Lys Ala Phe His Phe Met Phe Asp Phe Lys Pro Lys Asp 
190 195 200 205 

TTG ACC TCT CAA GTG TTC ACT TAT TCC TTG GGG CAG GTT TTC TTT TCC 675 
Leu Thr Ser Gin Val Phe Thr Tyr Ser Leu Gly Gin Val Phe Phe Ser 
210 215 220 

TTA AGC ATC GGT TTA GGG ATC AAT ATC ACT TAC GCT GCG GTT ACG GAT 723 
Leu Ser He Gly Leu Gly He Asn He Thr Tyr Ala Ala Val Thr Asp 
225 230 235 

AAA ACG CAG AAT TTG CTT AAA AGC ACT ATT TGG GTG GTT TTA TCA GGA 771 
Lys Thr Gin Asn Leu Leu Lys Ser Thr He Trp Val Val Leu Ser Gly 
240 245 250 

ATT CTA ATT TCT CTT GTG GCA GGA CTT ATG ATT TTC ACT TTT GTG TTT 819 
He Leu He Ser Leu Val Ala Gly Leu Met He Phe Thr Phe Val Phe 
255 260 265 

GAA TAT GGG GCG AAT GTC TCA CAA GGC ACA GGG TTA ATC TTC ACT TCT 867 

Glu Tyr Gly Ala Asn Val Ser Gin Gly Thr Gly Leu He Phe Thr Ser 
270 275 280 285 

TTA CCG GTG GTT TTT GGC CAA ATG GGA GCG ATA GGC ATT CTT GTT TCG 915 

Leu Pro Val Val Phe Gly Gin Met Gly Ala He Gly He Leu Val Ser 
290 295 300 

ATT CTT TTC TTG CTC GCG CTC GCT TTT GCT GGC ATC ACT TCT ACG GTG 96 3 

He Leu Phe Leu Leu Ala Leu Ala Phe Ala Gly He Thr Ser Thr Val 
305 310 315 

GCT TTA TTG GAG CCA AGC GTG ATG TAT CTT ACC GAA AGG TAT CAA TAC 1011 
Ala Leu Leu Glu Pro Ser Val Met Tyr Leu Thr Glu Arg Tyr Gin Tyr 
320 325 330 

TCT CGT TTT AAG GTT ACT TGG GGT CTT GTA GCA CTA ATT TTT GTG GTA 105 9 

Ser Arg Phe Lys Val Thr Trp Gly Leu Val Ala Leu He Phe Val Val 
335 340 345 
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GGC GTG GTG TTG ATT TTC-TCG CTC CAT • AAG GAT TAT AAA GAT TAT CTC 1107 

Gly Val Val Leu lie Phe Ser Leu His Lys Asp Tyr Lys Asp Tyr Leu 
350 355 360 365 

ACT TTC TTT GAA AAA AGT CTT TTT GAT TGG TTG GAT TTT GCA TCA AGC 115 5 

Thr Phe Phe Glu Lys Ser Leu Phe Asp Trp Leu Asp Phe Ala Ser Ser 
370 375 380 

ACC ATT ATC ATG CCT TTA GGC GGG ATG GCA ACC TTT ATT TTT ATG GGT 12 03 

Thr lie He Met Pro Leu Gly Gly Met Ala Thr Phe He Phe Met Gly 
385 390 395 

TGG GTT TTG AAA AAA GAA AAA TTG CGT CTT TTG AGC GTG CAC TTT TTA 12 51 

Trp Val Leu Lys Lys Glu Lys Leu Arg Leu Leu Ser Val His Phe Leu 
400 405 410 

GGC CCT AAA TTG TTT GCA ACT TGG TAT TTC TTG CTT AAA TAT ATC ACC 12 99 

Gly Pro Lys Leu Phe Ala Thr Trp Tyr Phe Leu Leu Lys Tyr He Thr 
415 420 425 

CCT TTA ATT GTG TTT TCC ATT TGG TTG AGC AAG ATT TAT TAAAATATTT GG 13 5 0 

Pro Leu He Val Phe Ser He Trp Leu Ser Lys He Tyr 
430 435 440 

CATGGGAAAA TTTTCTAAAT TAGGCT 13 76 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
Met Gly Asn His Phe Ser Lys Leu Gly Phe Val Leu Ala Ala Leu Gly 

15 10 15 

Ser Ala He Gly Leu Gly His He Trp Arg Phe Pro Tyr Met Thr Gly 

20 25 30 

Val Ser Gly Gly Gly Ala Phe Val Leu Leu Phe Leu Phe Leu Ser Leu 

35 40 45 

Ser Val Gly Ala Ala Met Phe He Ala Glu Met Leu Leu Gly Gin Ser 

50 55 60 

Thr Gin Lys Asn Val Thr Glu Ala Phe Lys Glu Leu Asp He Asn Pro 
65 70 75 80 

Lys Lys Arg Trp Lys Tyr Ala Gly Leu Leu Leu Val Ser Gly Pro Leu 

85 90 95 

He Leu Thr Phe Tyr Gly Thr He Leu Gly Trp Val Leu Tyr Tyr Leu 

100 105 HO 

Val Ser Val Ser Phe Asn Leu Pro Asn Asn He Gin Glu Ser Glu Gin 

115 120 125 

He Phe Thr Gin Thr Leu Gin Ser He Gly Leu Gin Ser He Gly Leu 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 16 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(ix) FEATURE: 

(A) NAME / KEY : Coding Sequence 

(B) LOCATION: 22... 1356 
'"(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

TTTAAAAGGT ATTTTATAAC G ATG AAA ATT TTT GGG ACT GAT GGC GTG AGG 51 

Met Lys lie Phe Gly Thr Asp Gly Val Arg 
15 10 

GGT AAA GCA GGG GTG AAA CTC ACC CCC ATG TTT GTG ATG CGT TTA GGC 99 
Gly Lys Ala Gly Val Lys Leu Thr Pro Met Phe Val Met Arg Leu Gly 
15 20 25 

ATT GCT GCC GGA TTG TAT TTT AAA AAA CAT TCT CAA ACG AAT AAA ATT 14 7 

lie Ala Ala Gly Leu Tyr Phe Lys Lys His Ser Gin Thr Asn Lys lie 
30 35 40 

CTA ATC GGT AAA GAC ACC AGA AAA AGC GGC TAT ATG GTA GAA AAC GCT 195 
Leu lie Gly Lys Asp Thr Arg Lys Ser Gly Tyr Met Val Glu Asn Ala 
45 50 55 

TTA GTG AGC GCT CTA ACT TCC ATA GGC TAT AAT GTG ATT CAA ATA GGG 24 3 

Leu Val Ser Ala Leu Thr Ser lie Gly Tyr Asn Val lie Gin lie Gly 
60 65 70 

CCT ATG CCC ACC CCT GCG ATT GCG TTT TTA ACT GAA GAC ATG CGC TGT 2 91 

Pro Met Pro Thr Pro Ala lie Ala Phe Leu Thr Glu Asp Met Arg Cys 
75 80 85 90 

GAT GCG GGT ATT ATG ATA AGC GCG AGC CAC AAC CCT TTT GAA GAT AAT 33 9 

Asp Ala Gly He Met He Ser Ala Ser His Asn Pro Phe Glu Asp Asn 
95 100 105 

GGC ATT AAG TTT TTC AAT TCT TAT GGC TAT AAG CTT AAA GAA GAA GAA 387 
Gly He Lys Phe Phe Asn Ser Tyr Gly Tyr Lys Leu Lys Glu Glu Glu 
110 m 115 120 

GAA AAA GCG ATT GAA GAA ATC TTT CAT GAT GAA GAA TTA CTG CAT TCT 43 5 

Glu Lys Ala He Glu Glu He Phe His Asp Glu Glu Leu Leu His Ser 
125 130 135 

AGC TAT AAA GTG GGT GAG AGC GTC GGT AGC GCT AAA AGG ATA GAC GAT 4 83 

Ser Tyr Lys Val Gly Glu Ser Val Gly Ser Ala Lys Arg He Asp Asp 
140 145 150 

GTC ATA GGG CGC TAT ATT GCA CAT TTA AAA CAC TCT TTC CCC AAA CAT 531 
Val He Gly Arg Tyr He Ala His Leu Lys His Ser Phe Pro Lys His 
155 160 165 170 

TTG AAT TTA CAG AGT TTA AGG ATC GTG CTA GAT ACG GCT AAT GGC GCG 57 9 

Leu Asn Leu Gin Ser Leu Arg He Val Leu Asp Thr Ala Asn Gly Ala 
175 180 185 

GCT TAT AAG GTG GCT CCG GTC GTT TTT AGC GAG CTT GGG GCT GAT GTG 62 7 

Ala Tyr Lys Val Ala Pro Val Val Phe Ser Glu Leu Gly Ala Asp Val 
190 195 200 
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TTA GTG ATT AAT GAT GAG-CCT AAC GGG TGT AAC ATT AAT GAT CAA TGC 67 5 

Leu Val lie Asn Asp Glu Pro Asn Gly Cys Asn lie Asn Asp Gin Cys 
205 210 215 

GGG GCT TTA CAC CCC AAC CAA TTA AGC CAG GAA GTG AAA AAA TAC CGC 72 3 

Gly Ala Leu His Pro Asn Gin Leu Ser Gin Glu Val Lys Lys Tyr Arg 
220 225 230 

GCA GAT TTA GGC TTT GCT TTT GAT GGC GAT GCT GAC AGG CTA GTG GTG 771 
Ala Asp Leu Gly Phe Ala Phe Asp Gly Asp Ala Asp Arg Leu Val Val 
235 240 245 250 

GTG GAT AAT TTA GGG AAT ATC GTG CAT GGG GAT AAG CTT TTA GGG GTG 819 
Val Asp Asn Leu Gly Asn He Val His Gly Asp Lys Leu Leu Gly Val 
255 260 265 

TTA GGG GTT TAT CAA AAA TCT AAA AAC GCC CTT TCT TCT CAA GCG GTT 867 
Leu Gly Val Tyr Gin Lys Ser Lys Asn Ala Leu Ser Ser Gin Ala Val 
270 275 280 

GTC GCC ACA AAC ATG AGC AAT TTA GCC CTT AAA GAA TAT TTA AAA TCC 915 
Val Ala Thr Asn Met Ser Asn Leu Ala Leu Lys Glu Tyr Leu Lys Ser 
285 290 295 

CAA GAT TTG GAA TTG AAG CAT TGC GCG ATT GGG GAT AAG TTT GTG AGC 96 3 

Gin Asp Leu Glu Leu Lys His Cys Ala He Gly Asp Lys Phe Val Ser 
300 305 310 

GAA TGC ATG CAA TTG AAT AAA GCC AAT TTT GGA GGC GAG CAA AGC GGG 1011 

Glu Cys Met Gin Leu Asn Lys Ala Asn Phe Gly Gly Glu Gin Ser Gly 
315 320 325 330 

CAT ATC ATT TTT AGC GAT TAC GCT AAA ACA GGC GAT GGT TTG GTG TGC 105 9 

His He He Phe Ser Asp Tyr Ala Lys Thr Gly Asp Gly Leu Val Cys 
335 340 345 

GCT TTG CAA GTG AGC GCG TTA GTG TTA GAA AGC AAG CAG GTA AGC TCT 1107 
Ala Leu Gin Val Ser Ala Leu Val Leu Glu Ser Lys Gin Val Ser Ser 
350 355 360 

GTT GCG TTA AAC CCC TTT GAA TTA TAC CCC CAA AGC CTA GTG AAT TTG 115 5 

Val Ala Leu Asn Pro Phe Glu Leu Tyr Pro Gin Ser Leu Val Asn Leu 
365 370 375 

AAT GTC CAA AAA AAG CCC CCT TTA GAA AGC CTG AAA GGT TAT AGC GCT 12 0 3 

Asn Val Gin Lys Lys Pro Pro Leu Glu Ser Leu Lys Gly Tyr Ser Ala 
380 385 390 

CTT TTA AAA GAA TTA GAC AAG CTA GAA ATC CGC CAT TTG ATC CGT TAT 12 51 

Leu Leu Lys Glu Leu Asp Lys Leu Glu He Arg His Leu He Arg Tyr 
395 400 405 410 

AGC GGC ACT GAA AAC AAA TTG CGA ATC CTT TTA GAA GCT AAA GAT GAA 12 99 

Ser Gly Thr Glu Asn Lys Leu Arg He Leu Leu Glu Ala Lys Asp Glu 
415 420 425 
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AAG CTT TTA GAA TCC AAA- ATG CAA GAA TTA AAA GAG TTT TTT GAA GGG 134 7 

Lys Leu Leu Glu Ser Lys Met Gin Glu Leu Lys Glu Phe Phe Glu Gly 
430 435 440 

CAT TTG TGC TAAAAACCAC TAAAAAAAGC CTGTTGGTTT TTATGG 13 92 

His Leu Cys 
445 



(2) INFORMATION FOR SEQ ID NO: 17 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 0: 

Met Lys lie Phe Gly Thr Asp Gly Val Arg Gly Lys Ala Gly Val Lys 

1 5 10 15 

Leu Thr Pro Met Phe Val Met Arg Leu Gly lie Ala Ala Gly Leu Tyr 

20 25 30 

Phe Lys Lys His Ser Gin Thr Asn Lys lie Leu He Gly Lys Asp Thr 

35 40 45 

Arg Lys Ser Gly Tyr Met Val Glu Asn Ala Leu Val Ser Ala Leu Thr 

50 55 60 

Ser He Gly Tyr Asn Val He Gin lie Gly Pro Met Pro Thr Pro Ala 
65 70 75 80 

He Ala Phe Leu Thr Glu Asp Met Arg Cys Asp Ala Gly He Met He 

85 90 95 

Ser Ala Ser His Asn Pro Phe Glu Asp Asn Gly He Lys Phe Phe Asn 

100 105 HO 

Ser Tyr Gly Tyr Lys Leu Lys Glu Glu Glu Glu Lys Ala He Glu Glu 

115 120 125 

He Phe His Asp Glu Glu Leu Leu His Ser Ser Tyr Lys Val Gly Glu 

130 135 140 

Ser Val Gly Ser Ala Lys Arg He Asp Asp Val He Gly Arg Tyr He 
145 150 155 160 

Ala His Leu Lys His Ser Phe Pro Lys His Leu Asn Leu Gin Ser Leu 

165 170 175 

Arg He Val Leu Asp Thr Ala Asn Gly Ala Ala Tyr Lys Val Ala Pro 

180 185 190 

Val Val Phe Ser Glu Leu Gly Ala Asp Val Leu Val lie Asn Asp Glu 

195 200 205 

Pro Asn Gly Cys Asn He Asn Asp Gin Cys Gly Ala Leu His Pro Asn 

210 215 220 

Gin Leu Ser Gin Glu Val Lys Lys Tyr Arg Ala Asp Leu Gly Phe Ala 
225 230 235 240 

Phe Asp Gly Asp Ala Asp Arg Leu Val Val Val Asp Asn Leu Gly Asn 

245 250 255 

He Val "His Gly Asp Lys Leu Leu Gly Val Leu Gly Val Tyr Gin Lys 
260 265 270 
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Ser Lys Asn Ala Leu Ser-Ser Gin Ala -Val Val Ala Thr Asn Met Ser 

275 280 285 

Asn Leu Ala Leu Lys Glu Tyr Leu Lys Ser Gin Asp Leu Glu Leu Lys 

290 295 300 

His Cys Ala lie Gly Asp Lys Phe Val Ser Glu Cys Met Gin Leu Asn 
305 310 315 320 

Lys Ala Asn Phe Gly Gly Glu Gin Ser Gly His lie He Phe Ser Asp 

325 330 335 

Tyr Ala Lys Thr Gly Asp Gly Leu Val Cys Ala Leu Gin Val Ser Ala 

340 345 350 

Leu Val Leu Glu Ser Lys Gin Val Ser Ser Val Ala Leu Asn Pro Phe 

355 360 365 

Glu Leu Tyr Pro Gin Ser Leu Val Asn Leu Asn Val Gin Lys Lys Pro 

370 375 380 

Pro Leu Glu Ser Leu Lys Gly Tyr Ser Ala Leu Leu Lys Glu Leu Asp 
385 390 395 400 

Lys Leu Glu He Arg His Leu He Arg Tyr Ser Gly Thr Glu Asn Lys 

405 410 415 

Leu Arg He Leu Leu Glu Ala Lys Asp Glu Lys Leu Leu Glu Ser Lys 

420 425 430 

Met Gin Glu Leu Lys Glu Phe Phe Glu Gly His Leu Cys 
435 440 445 



(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 
GCCGGATCCA TGACTTATGG GTATGGGGAA 3 0 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:172: 
GCCCTCGAGA CTTTTATTGA TTCACCATTT CATT 3 4 

(2) INFORMATION FOR SEQ ID NO:173: 
(i) SEQUENCE CHARACTERISTICS: 
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( A ) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 3: 
GCCGGATCCA TCGCTGAAGA AAATGGGGCG 3 0 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 174 : 

GCCCGGCCGC CCTAAAAACT ATAAACATAA CTC 3 3 

(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
GCCGGATCCG GTATTAGGAA GCTTATACCA TC 3 2 

(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

GCCCTCGAGA AGTTCTATTT TTAATTCCTT GAGAG 3 5 

(2) INFORMATION FOR SEQ ID NO: 177: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
GCCGGATCCT CTGATAGCCA TAAAGAAAAA AAGGAC 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
GCCCTCGAGA TCTTTAGAAA TCAACCCCCA AAGC 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 
(xi) SEQUENCE DESCRIPTION: 
GCCGGATCCG ACTTAGAACA TTTTAACACG 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
GCCCTCGAGT CATTTTAAAC G AC TCAAAAC AAA 

(2) INFORMATION FOR SEQ ID NO: 181: 



SEQ ID NO : 17 9 : 

CTC 3 3 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
GCCGGATCCG GCCAAAGCGT GCGCACTTAT TGG 3 3 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
GCCCTCGAGT TATTGTTCCA ACCCCCACGC ATC 3 3 

(2) INFORMATION FOR SEQ ID NO: 18 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

GCCGGATCCA AGAGCAATGC TGATGACAAA CC 3 2 

(2) INFORMATION FOR SEQ ID NO : 184 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 
GCCCTCGAGT TATGAGTTAA AGCCCCTTGT CC 32 
(2) INFORMATION FOR SEQ ID NO: 18 5: 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GCCGGATCCG AATCAGTAAA AACAGGAAAA AC 

(2) INFORMATION FOR SEQ ID NO:186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 
GCCCTCGAGC GGCTCTTTGG AG TTTTATTG 

(2) INFORMATION FOR SEQ ID NO: 18 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
GCCGGATCCA TCATTCCCTC TCGCTCTATG G 

(2) INFORMATION FOR SEQ ID NO: 18 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 
GCCCTCGAGA CCTTAATGCG TTGCGTTTTC TTT 

(2) INFORMATION FOR SEQ ID NO: 189: 
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(i) SEQUENCE CHARACTERISTICS : ■ 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 9: 

GCCGAGCTCC AAGCAAAAAA ATGTCAATTA AAAGGG 3 6 

(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 190 : 



GCCCTCGAGG TCTAAATTAG AATAAGTGTT GTT 3 3 
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1. An isolated polynucleotide that encodes: 

(i) a polypeptide comprising an amino acid sequence that is homologous to 
the amino acid sequence of z Helicobacter polypeptide, wherein said amino acid 
sequence of said Helicobacter polypeptide is selected from the group consisting of 
the amino acid sequences as shown in SEQ ID NO:2 (GHPO 13), SEQ ID NO:4 
(GHPO 73), SEQ ID NO:6 (GHPO 90), SEQ ID NO:8 (GHPO 107), SEQ ID NO: 10 
(GHPO 136), SEQ IDNO:12(GHPO 191), SEQ IDNO:14 (GHPO 213), SEQ ID 
NO: 16 (GHPO 240), SEQ ID NO: 18 (GHPO 408), SEQ ID NO:20 (GHPO 411), 
SEQ ID NO:22 (GHPO 419), SEQ ID NO:24 (GHPO 431), SEQ ID NO:26 (GHPO 
474), SEQ ID NO:28 (GHPO 591), SEQ ID NO:30 (GHPO 596), SEQ ID NO:32 
(GHPO 699), SEQ ID NO:34 (GHPO 724), SEQ ID NO:36 (GHPO 730), SEQ ID 
NO:38 (GHPO 761), SEQ ID NO:40 (GHPO 804), SEQ ID NO:42 (GHPO 805), 
SEQ ID NO:44 (GHPO 812), SEQ ID NO:46 (GHPO 879), SEQ ID NO:48 (GHPO 
888), SEQ ID NO:50 (GHPO 986), SEQ ID NO:52 (GHPO 1056), SEQ ID NO:54 
(GHPO 1081), SEQ ID NO:56 (GHPO 1 100), SEQ ID NO:58 (GHPO 1140), SEQ 
ID NO:60 (GHPO 1 148), SEQ ID NO:62 (GHPO 1200), SEQ ID NO:64 (GHPO 
1212), SEQ ID NO:66 (GHPO 1258), SEQ ID NO:68 (GHPO 1263), SEQ ID NO:70 
(GHPO 1273), SEQ ID NO:72 (GHPO 1284), SEQ ID NO:74 (GHPO 1299), SEQ 
ID NO:76 (GHPO 1327), SEQ ID NO:78 (GHPO 1346), SEQ ID NO: 80 (GHPO 
1378), SEQ ID NO:82 (GHPO 1412), SEQ ID NO:84 (GHPO 1443), SEQ ID NO:86 
(GHPO 1466), SEQ ID NO:88 (GHPO 1476), SEQ ID NO:90 (GHPO 1536), SEQ 
ID NO:92 (GHPO 1559), SEQ ID NO:94 (GHPO 427), SEQ ID NO:96 (GHPO 
1045), SEQ ID NO:98 (GHPO 1262), SEQ ID NO:100 (GHPO 1688), SEQ ID 
NO: 102 (GHPO 1538), SEQ ID NO: 104 (GHPO 346), SEQ ID NO: 106 (GHPO 
10 12), SEQ ID NO: 108 (GHPO 470), SEQ ID NO: 110 (GHPO 1398), SEQ ID 
NO:l 12 (GHPO 1550), SEQ ID NO:l 14 (GHPO 276), SEQ ID NO:l 16 (GHPO 
1501), SEQ ID NO:118 (GHPO 706), SEQ IDNO:120 (GHPO 1001), SEQ ID 
NO:122 (GHPO 732), SEQ ID NO:124 (GHPO 329), SEQ ID NO:126 (GHPO 574), 
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SEQ ID NO:128 (GHPO 1 190), SEQ ID NO.130 (GHPO 1374), SEQ ID NO:132 
(GHPO 1620), SEQ ID NO: 134 (GHPO 956), SEQ ID NO: 136 (HPO 98), SEQ ID 
NO: 138 (GHPO 689), SEQ ID NO: 140 (GHPO 208), SEQ ID NO: 142 (GHPO 296), 
SEQ ID NO: 144 (GHPO 726), SEQ ID NO: 146 (GHPO 1026), SEQ ID NO: 148 
(GHPO 1301), SEQ IDNO:150 (GHPO 1536), SEQ IDNO:152 (GHPO 166), SEQ 
ID NO: 154 (GHPO 253), SEQ ID NO: 156 (GHPO 297), SEQ ID NO: 158 (GHPO 
615), SEQ ID NO: 160 (GHPO 1278), SEQ ID NO: 162 (GHPO 1282), SEQ ID 
NO: 164 (GHPO 1420), SEQ ID NO: 166 (GHPO 1484), SEQ ID NO: 168 (GHPO 
1719), and SEQ ID NO: 170 (GHPO 1252); or 

(ii) a derivative of said polypeptide encoded by said polynucleotide. 

2. The isolated polynucleotide of claim 1, which encodes a mature form of 
said polypeptide. 

3. The isolated polynucleotide of claim 1 or 2, wherein the polynucleotide is 
a DNA molecule. 

4. A compound, in a substantially purified form, that is the mature form or a 
derivative of a polypeptide comprising an amino acid sequence that is homologous to 
a Helicobacter amino acid sequence that is selected from the group consisting of the 
amino acid sequences as shown in SEQ ID NO:2 (GHPO 13), SEQ ID NO:4 (GHPO 
73), SEQ ID NO:6 (GHPO 90), SEQ ID NO:8 (GHPO 107), SEQ ID NO: 10 (GHPO 
136), SEQ ID NO: 12 (GHPO 191), SEQ ID NO:14 (GHPO 213), SEQ ID NO: 16 
(GHPO 240), SEQ ID NO: 18 (GHPO 408), SEQ ID NO:20 (GHPO 411), SEQ ID 
NO:22 (GHPO 419), SEQ ID NO:24 (GHPO 431), SEQ ID NO:26 (GHPO 474), 
SEQ ID NO:28 (GHPO 591), SEQ ID NO:30 (GHPO 596), SEQ ID NO:32 (GHPO 
699), SEQ ID NO:34 (GHPO 724), SEQ ID NO:36 (GHPO 730), SEQ ID NO:38 
(GHPO 761), SEQ ID NO:40 (GHPO 804), SEQ ID NO:42 (GHPO 805), SEQ ID 
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NO:44 (GHPO 812), SEQ ID NO:46 (GHPO 879), SEQ ID NO:48 (GHPO 888), 
SEQ ID NO:50 (GHPO 986), SEQ ID NO:52 (GHPO 1056), SEQ ID NO:54 (GHPO 
1081), SEQ ID NO:56(GHPO 1100), SEQIDNO:58 (GHPO 1140), SEQ ID NO:60 
(GHPO 1148), SEQ ID NO:62 (GHPO 1200), SEQ ID NO:64 (GHPO 1212), SEQ 
ID NO:66 (GHPO 1258), SEQ ID NO:68 (GHPO 1263), SEQ ID NO:70 (GHPO 
1273), SEQ ID NO:72 (GHPO 1284), SEQ ID NO:74 (GHPO 1299), SEQ ID NO:76 
(GHPO 1327), SEQ ID NO:78 (GHPO 1346), SEQ ID NO: 80 (GHPO 1378), SEQ 
ID NO:82 (GHPO 1412), SEQ ID NO:84 (GHPO 1443), SEQ ID NO:86 (GHPO 
1466), SEQ ID NO:88 (GHPO 1476), SEQ ID NO:90 (GHPO 1536), SEQ ID NO:92 
(GHPO 1559), SEQ ID NO:94 (GHPO 427), SEQ ID NO:96 (GHPO 1045), SEQ ID 
NO:98 (GHPO 1262), SEQ ID NO: 100 (GHPO 1688), SEQ ID NO: 102 (GHPO 
1538), SEQ ID NO: 104 (GHPO 346), SEQ ID NO: 106 (GHPO 1012), SEQ ID 
NO: 108 (GHPO 470), SEQ ID NO:l 10 (GHPO 1398), SEQ ID NO: 112 (GHPO 
1550), SEQ ID NO:l 14 (GHPO 276), SEQ ID NO:l 16 (GHPO 1501), SEQ ID 
NO:l 18 (GHPO 706), SEQ ID NO: 120 (GHPO 1001), SEQ ID NO: 122 (GHPO 
732), SEQ ID NO: 124 (GHPO 329), SEQ ID NO: 126 (GHPO 574), SEQ ID NO: 128 
(GHPO 1 190), SEQ ID NO: 130 (GHPO 1374), SEQ ID NO: 132 (GHPO 1620), SEQ 
ID NO: 134 (GHPO 956), SEQ ID NO: 136 (HPO 98), SEQ ID NO: 138 (GHPO 689), 
SEQ ID NO: 140 (GHPO 208), SEQ ID NO: 142 (GHPO 296), SEQ ID NO: 144 
(GHPO 726), SEQ ID NO: 146 (GHPO 1026), SEQ ID NO: 148 (GHPO 1301), SEQ 
ID NO: 150 (GHPO 1536), SEQ ID NO: 152 (GHPO 166), SEQ ID NO: 154 (GHPO 
253), SEQ ID NO: 1 56 (GHPO 297), SEQ ID NO: 1 58 (GHPO 615), SEQ ID NO: 1 60 
(GHPO 1278), SEQ ID NO: 162 (GHPO 1282), SEQ ID NO: 164 (GHPO 1420), SEQ 
ID NO:166 (GHPO 1484), SEQ ID NO:168 (GHPO 1719), and SEQ ID NO:170 
(GHPO 1252); or 

(ii) a derivative of said polypeptide. 
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5. A method of preventing or treating Helicobacter infection in a mammal, 
said method comprising administering to said mammal a prophylactically or 
therapeutically effective amount of a compound of claim 4. 

6. The method of claim 5, further comprising administering to said mammal 
5 an antibiotic, an antisecretory agent, a bismuth salt, or a combination thereof. 

7. The method of claim 6, wherein said antibiotic is selected from the group 
consisting of amoxicillin, clarithromycin, tetracycline, metronidizole, and 
erythromycin, and said bismuth salt is selected from the group consisting of bismuth 
subcitrate and bismuth subsalicylate. 

10 8. The method of claim 6, wherein said antisecretory agent is a proton pump 

inhibitor, an H 2 -receptor antagonist, or a prostaglandin analog. 

9. The method of claim 8, wherein said proton pump inhibitor is selected 
from the group consisting of omeprazole, lansoprazole, and pantoprazole; said H 2 - 
receptor antagonist is selected from the group consisting of ranitidine, cimetidine, 

15 famotidine, nizatidine, and roxatidine; and said prostaglandin analog is selected from 
the group consisting of misoprostil and enprostil. 

10. The method of claim 5, further comprising administering to said mammal 
a prophylactically or therapeutically effective amount of a second Helicobacter 
polypeptide or a derivative thereof. 

20 11. The method of claim 10, wherein the second Helicobacter polypeptide is 

a Helicobacter urease, a subunit, or a derivative thereof. 
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12. A composition comprising a compound of claim 4, together with a 
physiologically acceptable diluent or carrier. 

13. The composition of claim 12, further comprising an adjuvant. 

14. The composition of claim 12, further comprising a second Helicobacter 
5 polypeptide or a derivative thereof. 

15. The composition of claim 14, wherein said second Helicobacter 
polypeptide is a Helicobacter urease, a subunit, or a derivative thereof. 

1 6. A method of preventing or treating Helicobacter infection in a mammal, 
said method comprising administering to said mammal a prophylactically or 

1 0 therapeutically effective amount of a polynucleotide of claim 1 . 

17. A composition comprising a viral vector, in the genome of which is 
inserted a DNA molecule of claim 1 , said DN A molecule being placed under 
conditions for expression in a mammalian cell and said viral vector being admixed 
with a physiologically acceptable diluent or carrier. 

15 1 8. A composition that comprises a bacterial vector comprising a DNA 

molecule of claim 1, said DNA molecule being placed under conditions for 
expression and said bacterial vector being admixed with a physiologically acceptable 
diluent or carrier. 

19. The composition of claim 18, wherein said vector is selected from the 
20 group consisting of Shigella, Salmonella, Vibrio cholerae, Lactobacillus, Bacille 
bilie de Calmette-Guerin, and Streptococcus. 
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20. A composition comprising a polynucleotide of claim 1, together with a 
physiologically acceptable diluent or carrier. 

21 . The composition of claim 20, wherein said polynucleotide is a DNA 
molecule that is inserted in a plasmid that is unable to replicate and to substantially 
integrate in a mammalian genome and is placed under conditions for expression in a 
mammalian cell. 

22. An expression cassette comprising a DNA molecule of claim 1, said 
DNA molecule being placed under conditions for expression in a procaryotic or 
eucaryotic cell. 

23. A process for producing a compound of claim 4, which comprises 
culturing a procaryotic or eucaryotic cell transformed or transfected with an 
expression cassette of claim 22, and recovering said compound from the cell culture. 

24. A method of preventing or treating Helicobacter infection in a mammal, 
said method comprising administering to said mammal a prophylactically or 
therapeutically effective amount of an antibody that binds to the compound of 
claim 4. 
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